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Novel Human ^2 Integrin Alpha Subunit 

This application is a continuation-in-part of U.S. Application Serial 
No. 08/286,889, filed August 5, 1994, which is pending, which in turn is a 
continuation-in-part of U.S. Application Serial No. 08/173,497, filed December 
5 23, 1993, which is pending. 

Field of the Invention 

The present invention relates to the cloning and expression of 
polynucleotides encoding a novel human 02 integrin a subunit, designated a^, 
which is structurally related to the known human p2 integrin a subunits, CDlla, 
0 CDl lb and CDl Ic. The present invention also relates to polynucleotides isolated 

from other species which show homology to human encoding sequences. 

Background of the Invention 

The integrins are a class of membrane-associated molecules which 
actively participate in cellular adhesion. Integrins are transmembrane 
5 heterodimers comprising an of subunit in noncovalent association with a 0 subunit. 

To date, at least fourteen a subunits and eight 0 subunits have been identified 
[reviewed in Springer, Nature 346:425-434 (1990)]. The /8 subunits are generally 
capable of association with more than one a subunit and the heterodimers sharing 
a common 0 subunit have been classified as subfamilies within the integrin 
20 population. 

One class of human integrins, restricted to expression in white 
blood cells, is characterized by a common /S2 subunit. As a result of this cell- 
specific expression, these integrins are commonly referred to as the leukocyte 
integrins, Leu-CAMs or leukointegrins. Because of the common ^2 subunit, an 
25 alternative designation of this class is the /S2 integrins. The ^2 subunit (CD18) 

has previously been isolated in association with one of three distinct a subunits, 
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CDlla, CDllb or CDllc. The isolation of a cDNA encoding human CD18 is 
described in Kishimoto, et al. Cell 48:681-690 (1987). In official AVHO 
nomenclature, the heterodimeric proteins are referred to as CDlla/CD18, 
CDllb/CD18, and CDllc/CD18; in common nomenclature they are referred to 
5 as LFA-1, Mac-1 or Mol and pl50,95 or LeuM5, respectively [Cobbold, et aL, 

in Leukocyte Typing III, McMichael (ed), Oxford Press, p.788 (1987)], The 
human ^2 integrin a subunits CDlla, CDllb and CDllc have been demonstrated 
to migrate under reducing condition in electrophoresis with apparent molecular 
weights of approximately 180 kD, 155 kD and 150 kD, respectively, and DNAs 

10 encoding these subunits have been cloned [CDlla, Larson, et aL, J. Cell Biol. 

708:703-712 (1989); CDl lb, Corbi, et al. . J.BioL Chem. 263: 12403-1241 1 (1988) 
and CDllc, Corbi, et aL EMBO J. 5:4023-4028 (1987)]. Putative homologs of 
the human ^2 integrin ot and jS chains, defined by approximate similarity in 
molecular weight, have been variously identified in other species including 

15 monkeys and other primates [Letvin, et aL, Blood 67:408-410 (1983)], mice 

[Sanchez-Madrid, et aL,J.Exp.Me± 154:1511 (1981)], and dogs [Moore, et al. 
Tissue Antigens 36:211-220 (1990)]. 

The absolute molecular weights of presumed homologs from other 
species have been shown to vary significantly [see, e.g., Danilenko et aL, Tissue 

20 Antigens 40:13-21 (1992)], and in the absence of sequence information, a 

definitive correlation between human integrin subunits and those identified in 
other si>ecies has not been possible. Moreover, variation in the number of 
members in a protein family has been observed between different species. 
Consider, for example, that more IgA isotypes have been isolated in rabbits than 

25 in humans [Burnett, et aL, EMBO J. 5:4041-4047 (1989) and Schneiderman, et 

aL, Proc.NatLAcad.ScLfUSA) S6: 7561-7565 (1989)]. Similarly, in humans, at 
least six variants of the metallothionine protein have been previously identified 
[Karin and Richards, Nature 2PP: 797-802 (1982) and Varshney, et aL, 
MoL CelLBioL 6:26-37, (1986)], whereas in the mouse, only two such variants are 
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in evidence [Searle, et ah. MolCett.Biol ^: 1221-1230 (1984)]. Therefore, 
existence of multiple members of a protein family in one species does not 
necessarily imply that corresponding family members exist in another species. 

In the specific context of ^2 integrins, in dogs it has been observed 
that the presumed canine &2 counterpart to the human CD18 is capable of dimer 
formation with as many as four potentially distinct a subunits [Danilenko, et al, 
supra]. Antibodies generated by immunizing mice with canine splenocytes 
resulted in monoclonal antibodies which immunoprecipitated proteins tentatively 
designated as canine homologs to human CD18, CDlla, CDl lb and CDl Ic based 
mainly on simUar, but not identical, molecular weights. Another anti-canine 
splenocyte antibody, Call,8H2, recognized and immunoprecipitated a fourth a- 
like canine subunit also capable of association with the /Sj subunit, but having a 
unique molecular weight and restricted in expression to a subset of differentiated 
tissue macrophages. Antibodies generated by immunization of hamsters with 
murine dendritic cells resulted in two anti-integrin antibodies [Metlay, et al, 
J.Exp.Med. J7i:1753-1771 (1990)]. One antibody, 2E6, immunoprecipitated a 
predominant heterodimer with subunits having approximate molecular weights of 
180 kD and 90 kD in addition to minor bands in the molecular weight range of 
150-160 kD. The second antibody, N418, precipitated another apparent 
heterodimer with subunits having approximate molecular weights of 150 kD and 
90 Kd. Based on cellular adhesion blocking studies, it was hypothesized that 
antibody 2E6 recognized a murine counterpart to human CDl 8. While the 
molecular weight of the N418 antigen suggested recognition of a murine homolog 
to human CDl.lc/CD18, further analysis indicated that the murine antigen 
exhibited a tissue distribution pattern which was inconsistent with that observed 
for human CDllc/CD18. 

The antigens recognized by the canine Call.8H2 antibody and the 
murine N418 antibody could represent a variant species (e.g., a glycosylation or 
splice variant) of a previously identified canine or murine a subunit. 
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Alternatively, these antigens may represent unique canine and murine integrin a 
subunits. In the absence of specific information regarding primary structure, 
these alternatives cannot be distinguished. 

In humans, CDlla/CD18 is expressed on all leukocytes. 
CDllb/CD18 and CDllc/CD18 are essentially restricted to expression on 
monocytes, granulocytes, macrophages and natural killer (NK) cells, but 
CDllc/CDlS is also detected on some B-cell types. In general, CDlla/CD18 
predominates on lymphocytes, CDllb/CD18 on granulocytes and CDllc/CD18 
on macrophages [see review, Amaout, Blood 75:1037-1050 (1990)]. Expression 
of the a chains, however, is variable with regard to the state of activation and 
differentiation of the individual cell types [See review, Larson and Springer, 
ImmunoLRev. 114:181-217(1990).] 

The involvement of the 02 integrins in human immune and 
inflammatory responses has been demonstrated using monoclonal antibodies which 
15 are capable of blocking integrin-associated cell adhesion. For example, 

CDlla/CD18, CDllb/CD18 and CDllc/CD18 actively participate in natural 
killer (NK) cell binding to lymphoma and adenocarcinoma cells [Patarroyo, et ah , 
ImmunoLRev, 114:67-108 (1990)], granulocyte accumulation [Nourshargh, etaL, 
JJmmunoL 7^2:3193-3198 (1989)], granulocyte-independent plasma leakage 
20 [Arfors, et al, Blood 5P:338-340 (1987)], chemotactic response of stimulated 

leukoc>rtes [Arfors, et aL, supra] and leukocyte adhesion to vascular endothelium 
[Price, et aL, JJmmunoL 139:4174-4177 (1987) and Smith, et aL. J. Clin, Invest. 
SJ:2008-2017 (1989)]. The fundamental role of ^2 integrins in immune and 
inflammatory responses is made apparent in the clinical syndrome referred to as 
25 leukocyte adhesion deficiency (LAD), wherein clinical manifestations include 

recurrent and often life threatening bacterial infections. LAD results from 
heterogeneous mutations in the ^2 subunit [Kishimoto, et aL, Cell 50:193-202 
(1987)] and the severity of the disease state is proportional to the degree of the 
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deficiency in ^2 subunit expression. Formation of the complete integrin 
heterodimer is impaired by the ^2 Ji^utation [Kishimoto, et ah, supra]. 

Interestingly, at least one antibody specific for CD18 has been 
shown to inhibit human immunodeficiency virus type-1 (HTV-l) syncytia 
5 formation in vitrOj albeit the exact mechanism of this inhibition is unclear 
[Hildreth and Orentas, Science 244:1075-1078 (1989)]. This observation is 
consistent with the discovery that a principal counterreceptor of CDlla/CD18, 
ICAM-1, is also a surface receptor for the major group of rhinovirus serotypes 
[Greve, et al. Cell 55:839 (1989)], 

10 The significance of ^2 integrin binding activity in human immune 

and inflammatory responses underscores the necessity to develop a more complete 
understanding of this class of surface proteins. Identification of yet unknown 
members of this subfamily, as well as their counterreceptors, and the generation 
of monoclonal antibodies or other soluble factors which can alter biological 

15 activity of the 02 integrins will provide practical means for therapeutic 

intervention in ^2 integrin-related immune and inflammatory responses. 

Brief Description of the Invention 

In one aspect, the present invention provides novel purified and 
isolated polynucleotides (c.^., DNA and KNA transcripts, both sense and anti- 

20 sense strands) encoding a novel human $2 integrin a subunit, a^, and variants 
thereof (i.e., deletion, addition or substitution analogs) which possess binding 
and/or immunological properties inherent to a^. Preferred DNA molecules of the 
invention include cDNA, genomic DNA and wholly or partially chemically 
synthesized DNA molecules. A presently preferred polynucleotide is the DNA 

25 as set forth in SEQ ID NO: 1, encoding the polypeptide of SEQ ID NO: 2. Also 
provided are recombinant plasmid and viral DNA constructions (expression 
constructs) which include encoding sequences, wherein the encoding 
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sequence is operatively linked to a homologous or heterologous transcriptional 
regulatory element or elements. 

Also provided by the present invention are isolated and purified 
mouse and rat polynucleotides which exhibit homology to polynucleotides 
encoding human a^. A preferred mouse polynucleotide is set forfh in SEQ ID 
NO: 52; a preferred rat polynucleotide is set forth in SEQ ID NO: 54. 

As another aspect of the invention, prokaryotic or eukaryotic host 
cells transformed or transfected with DNA sequences of the invention are 
provided which express polypeptide or variants thereof. Host cells of the 
invention are particularly useful for large scale production of polypeptide, 
which can be isolated from either the host cell itself or from the medium in which 
the host cell is grown. Host cells which express polypeptide on their 
extracellular membrane surface are also useful as immunogens in the production 
of aj-specific antibodies. Preferably, host ceUs transfected with will be co- 
transfected to express a 02 integrin subunit in order to allow surface expression 
of the heterodimer. 

Also provided by the present invention are purified and isolated 
polypeptides, fragments and variants thereof. .Preferred polypeptides are as 
set forth in SEQ ID NO: 2. Novel products of the invention may be obtained 
as isolates from natural sources, but, along with variant products, are 
preferably produced by recombinant procedures involving host cells of the 
invention. Completely glycosylated, partially glycosylated and wholly de- 
glycosylated forms of the polypeptide may be generated by varying the host 
cell selected for recombinant production and/or post-isolation processing. Variant 
"d polypeptides of the invention may comprise water soluble and insoluble 
polypeptides including analogs wherein one or more of the amino acids are 
deleted or replaced: (1) without loss, and preferably with enhancement, of one or 
more biological activities or immunological characteristics specific for a^; or (2) 
with specific disablement of a particular ligand/receptor binding or signalling 
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library. Alternatively, polymerase chain reaction (PGR) using oligonucleotide 
primers that are designed based on the known cDNA sequence can be used to 
amplify and identify genomic DNA sequences. Synthetic DNAs encoding the 
a^j polypeptide, including fragments and other variants thereof, may be produced 
5 by conventional synthesis methods. 

DNA sequence information of the invention also makes possible the 
development, by homologous recombination or "knockout" strategies [see, e.^., 
Kapecchi, Science 244:1288-1292 (1989)], to produce rodents that fail to express 
a functional polypeptide or that express a variant ot^ polypeptide. Such rodents 
10 are useful as models for studying the activities of and modulators in vivo, 

DNA and amino acid sequences of the invention also make possible 
the analysis of epitopes which actively participate in counterreceptor binding 
as well as epitopes which may regulate, rather than actively participate in, 
binding. Identification of epitopes which may participate in transmembrane signal 
15 transduction is also comprehended by the invention. 

DNA of the invention is also useful for the detection of cell types 
which express polypeptide. Standard DNA/RNA hybridi2ation techniques 
which utilize DNA to detect RNA may be used to determine the 
constitutive level of ot^ transcription within a cell, as well as changes in the level 
20 of transcription in response to internal or external agents. Identification of agents 

which modify transcription and/or translation of oi^ can, in turn, be assessed for 
potential therapeutic or prophylactic value. DNA of the invention also makes 
possible in situ hybridization of DNA to cellular RNA to determine the cellular 
localization of oi^ specific messages within complex cell populations and tissues. 
25 DNA of the invention is also useful for identification of non-human 

polynucleotide sequences which display homology to human sequences. 
Possession of non-human a^j DNA sequences permits development of animal 
models (including, for example, transgenic models) of the human system. 
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transfecting appropriate host cells with a DNA construct comprising a reporter 
gene under the control of a promoter regulated by a transcription factor having 
a DNA-binding domain and an activating domain, expressing in the host cells a 
first hybrid DNA sequence encoding a first fusion of part or all of and either 
5 the DNA binding domain or the activating domain of the transcription factor, 

expressing in the host cells a library of second hybrid DNA sequences encoding 
second fusions of part or all of putative binding proteins and the DNA binding 
domain or activating domain of the transcription factor which is not incorporated 
in the first fusion, detecting binding of an binding protein to in a particular 

10 host cell by detecting the production of reporter gene product in the host cell, and 

isolating second hybrid DNA sequences encoding binding protein from the 
particular host cell. 

Hybridoma cell lines which produce antibodies specific for are 
also comprehended by the invention. Techniques for producing hybridomas which 

15 secrete monoclonal antibodies are well known in the art. Hybridoma cell lines 

may be generated after immunizing an animal with purified a^, variants of or 
cells which express a ^ or a variant thereof on the extracellular membrane surface. 
Immunogen cell types include cells which express in vivo, or transfected 
prokaryotic or eukaryotic cell lines which normally do not normally express 

20 in vivo. 

The value of the information contributed through the disclosure of 
the DNA and amino acid sequences of is manifest. In one series of examples, 
the disclosed CDNA sequence makes possible the isolation of the human 
genomic DNA sequence, including transcriptional control elements for the 
25 genomic sequence. Identification of allelic variants and heterologous species 

(e.g., rat or mouse) DNAs is also comprehended. Isolation of the human 
genomic DNA and heterologous species DNAs can be accomplished by standard 
DNA/DNA hybridization techniques, under appropriately stringent conditions, 
using all or part of the cDNA sequence as a probe to screen an appropriate 
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thereof on a solid support coated (or impregnated with) a fluorescent agent, 
labelling the Ugand with a compound capable of exciting the fluorescent agent, 
contacting the immobilized a, with the labelled Ugand in the presence and absence 
of a puutive modulator compound, detecting light emission by the fluorescent 
agem, and identifying modulating compounds as those compounds that affa:t the 
emission of light by the fluorescent agent in comparison to the emission of light 
by the fluorescent agent in the absence of a modulating compound. AltemaUvely . 
the Ugand may be immobiUzed and may be labelled in the assay. 

Yet another method contemplated by the invention for idenUfymg 
compounds that modulate the interaction between a, and a ligand involves 
transforming or transfecting appropriate host cells with a DNA construct 
comprising a reporter gene under the control of a promoter regulated by a 
transcription factor having a DNA-binding domain and an activating domain, 
expressing in the host cells a first hybrid DNA sequence encoding a first fusion 
of part or all of and either the DNA binding domain or the activating domam 
of the transcription factor, expressing in the host cells a second hybrid DNA 
sequence encoding part or all of the ligand and the DNA binding domain or 
activating domain of the transcription factor which is not incorporated in the first 
fusion, evaluating the effect of a putative modulating compound on the interaction 
between and the ligand by detecting binding of the ligand to a, in a parUcular 
host cell by measuring the production of reporter gene product in the host cell m 
ti.e presence or absence of the putative modulator, and identifying modulating 
compounds as those compounds altering production of the reported gene product 
in comparison to production of the reporter gene product in the absence of the 
modulating compound. Presently preferred for use in the assay are the lexA 
promoter, the lexA DNA binding domain, tiie GAIA transactivation domain, the 
lacZ reporter gene, and a yeast host cell. 

A modified version of the foregoing assay may be used in isolating 
^ucleotide encoding a protein tiiat binds to by transforming or 
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function. Fusion polypeptides are also provided, wherein amino acid 
sequences are expressed contiguously with amino acid sequences from other 
polypeptides. Such fusion polypeptides may possess modified biological, 
biochemical, and/or immunological piopCTties in comparison to wild-type a^. 
Analog polypeptides including additional amino acid (e.g., lysine or cysteine) 
residues that faciUtate multimer formation are contemplated. 

Also comprehended by tiie present invention are polypeptides and 
otfier non-pq,tide molecules which specifically bind to a^. Preferred binding 
molecules include antibodies (e.g., monoclonal and polyclonal antibodies), 
counterreceptors (e.g., membrane-associated and soluble forms) and oUier ligands 
(e.g., naturally occurring or syntiietic molecules), including tiiose which 
competitively bind in the presence of monoclonal antibodies and/or specific 
counterreceptors. Binding molecules are useful for purification of polypeptides 
and identifying cell types which express a^. Binding molecules are also useful for 
modulating (i.e., inhibiting, blocking or stimulating) of in vivo binding and/or 
signal transduction activities of a^. 

Assays to identify binding molecules are also provided, 
including immobilized ligand binding assays, solution binding assays, scintillation 
proximity assays, di-hybrid screening assays, and the like. 

In vitro assays for identifying antibodies or otiier compounds tiiat 
modulate the activity of may involve, for example, immobilizing or a 
natural ligand to which binds, detectably labelUng the nonimmobilized binding 
partner, incubating tiie binding partners together and determining tiie effect of a 
test compound on tiie amount of label bound wherein a reduction in the label 
bound in the presence of tiie test compound compared to tfie amount of label 
bound in tiie absence of tiie test compound indicates tiiat the test agent is an 
inhibitor of binding. 

Anotiier type of assay for identifying compounds that modulate the 
interaction between and a Hgand involves immobUizing or a fragment 
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As another aspect of the invention, monoclonal or polyclonal 
antibodies specific for may be employed in immunobistochemical analysis to 
localize to subcellular compartments or individual cells within tissues. 
Immunobistochemical analyses of this type are particularly useful when used in 
combination with in situ hybridization to localize both mRNA and polypeptide 

products of the gene- 

Identification of cell types which express may have significant 
ramifications for development of therapeutic and prophylactic agents. It is 
anticipated that the products of the invention related to can be employed in the 
treatment of diseases wherein macrophages are an essential element of the disease 
process. Animal models for many pathological conditions associated with 
macrophage activity have been described in the art. For example, in mice, 
macrophage recruitment to sites of both chronic and acute inflammation is 
reported by Jutila, et al, J.Leukocyte Biol. 54:30-39 (1993). In rats, Adams, et 
al , ITransplantation 53: 1115-11 19(1992) and Transplantation 56:794-799 (1993)] 
describe a model for graft arteriosclerosis following heterotropic abdominal 
cardiac allograft transplantation. Rosenfeld, etaL, [Arteriosclerosis 7:9-23(1987) 
and Arteriosclerosis 7:24-34 (1987)] describe induced atherosclerosis in rabbits 
fed a cholesterol supplemented diet. Hanenberg, et al , [Diabetologia 32: 126-134 
(1989)] report the spontaneous development of insulin-dependent diabetes in BB 
rats. Yamada et al., [Gastroenterolgy 104:159-111 (1993)] describe an induced 
inflammatory bowel disease, chronic granulomatous colitis, in rats following 
injections of streptococcal peptidoglycan-polysaccharide polymers. Cromartie, et 
al, [J.ExpMed. 745:1585-1602 (1977)] and Schwab, et al, [Infection and 
Immunity 5P:4436-4442 (1991)] report that injection of streptococcal cell wall 
protein into rats results in an arthritic condition characterized by inflammation of 
peripheral joints and subsequent joint destruction. Finally, Huitinga, et aL, 
[Eur.JJmmunol 23:709-715 (1993) describe experimental allergic 
encephalomyelitis, a model for multiple sclerosis, in Lewis rats. In each of these 



wo 95/17412 



PCT/US94/14832 



• 12 - 

models, antibodies, other binding proteins, or soluble forms of <x^ are 
utilized to attenuate the disease state, presumably through inactivation of 
macrophage activity. 

Pharmaceutical compositions for treatment of these and other 

5 disease states are provided by the invention. Pharmaceutical compositions are 

designed for the purpose of inhibiting interaction between and its ligand(s) and 
include various soluble and membrane-associated forms of (comprising the 
entire polypeptide, or fragments thereof which actively participate in cx^ 
binding), soluble and membrane-associated forms of binding proteins 

0 (including antibodies, ligands, and the like), intracellular or extracellular 

modulators of binding activity, and/or modulators of of^ and/or a^-ligand 
polypeptide expression, including modulators of transcription, translation, post- 
translational processing and/or intracellular transport. The invention also 
comprehends methods for treatment of disease states in which or^j binding is 

5 implicated, wherein a patient suffering from said disease state is provided an 

amount of a pharmaceutical composition of the invention sufficient to modulate 
levels of ofj, binding. The method of treatment of the invention is applicable to 
disease states such as, but not limited to. Type I diabetes, atherosclerosis, 
multiple sclerosis, asthma, psoriasis, and rheumatoid arthritis. 



Brief Description of the Drawing 

Numerous other aspects and advantages of the present invention 
will be apparent upon consideration of the following description thereof, reference 
being made to the drawing wherein: 

Figure 1 A through ID comprises an alignment of the human amino 
acid sequences of CDllb (SEQ ID NO: 3), CD 11c (SEQ ID NO: 4) and (SEQ 
ID NO: 2). 
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Detailed Description of the Invention 

The present invention is illustrated by the follovtdng examples 
relating to the isolation of a cDNA clone encoding from a human spleen 
cDNA library. More particularly, Example 1 illustrates the use of anti-canine 

5 axMi antibody in an attempt to detect a homologous human protein. Example 2 

details purification of canine a-pMi N-terminal sequencing of the polypeptide 
to design oligonucleotide primers for PGR amplification of the canine ot-j^i gene. 
Example 3 addresses large scale purification of canine ajMl internal 
sequencing in order to design additional PGR primers. Example 4 describes use 

10 of the PGR and internal sequence primers to amplify a fragment of the canine 

^TMl g^"^- Example 5 addresses cloning of the human aj-encoding cDNA 
sequence. Example 6 describes Northern blot hybridization analysis of human 
tissues and cells for expression of mRNA. Example 7 details the construction 
of human expression plasmids and transfection of COS cells with the resulting 

15 plasmids. Example 8 addresses ELISA analysis of expression in transfected 

GOS cells. Example 9 describes FAGS analysis of COS cells transfected with 
human of^ expression plasmids. Example 10 addresses immunoprecipitation of 
GDI 8 in association with in co-transfected GOS cells. Example 11 relates to 
stable transfection of expression constructs in Chinese hamster ovary cells. 

20 Example 12 addresses CD 18-dependent binding of to the intercellular adhesion 

molecule, ICAM-R. Example 13 describes scintillation proximity screening 
assays to identify inhibitors of orj ligand/anti-hgand binding interactions. 
ExEunple 14 addresses construction of expression plasmids which encode soluble 
forms of aj. Example 15 relates to production of a^j-specific monoclonal 

25 antibodies. Example 16 describes analysis of tissue distribution using 

polyclonal antiserum. Example 17 describes isolation of rat cDNA sequences 
which show homology to human gene sequences. Example 18 relates to 
construction of rat I domain expression plasmids, including I domain/IgG 
fusion proteins, and production of monoclonal antibodies to I domain fusion 
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proteins. Example 19 addresses isolation of mouse cDNA sequences which show 
homology to human gene sequences. Example 20 describes isolation of 
additional mouse cDNA clones used for conformational sequence analysis. 
Example 21 relates to in situ hybridization analysis of various mouse tissues to 
determine tissue and cell specific expression of the putative mouse homolog to 
human a^. Example 22 describes generation of expression constructs which 
encode the putative mouse homolog of human aj. Example 23 addresses design 
of a "knock-out" mouse wherein the gene encoding the putative mouse homolog 
of human or^ is disrupted. Example 24 describes isolation of rabbit cDNA clones 
which show homology to human encoding sequences. Example 25 describes 
animal models which resemble human disease states wherein modulation of is 
assayed for therapeutic capabilities. 

Example 1 

Attempt to Detect a Human Homolog of Canine f^Y Kfi 

The monoclonal antibody CalL8H2 [Moore, et al., supra] specific 
for canine ajMi tested for cross-reactivity on human peripheral blood 
leukocytes in an attempt to identify a human homolog of canine a'TMi- Cell 
preparations (typically 1 x 10^ cells) were incubated with undiluted hybridoma 
supernatant or a purified mouse IgG-negative control antibody (10 /xg/ml) on ice 
in the presence of 0. 1 % sodium azide. Monoclonal antibody binding was detected 
by subsequent incubation with FITC-conjugated horse anti-mouse IgG (Vector 
Laboratories, Burlingame, CA) at 6 ftg/ml. Stained cells were fixed with 2% w/v 
paraformaldehyde in phosphate buffered saline (PBS) and were analyzed with a 
Facstar Plus fluorescence-activated cell sorter (Becton Dickinson, Mountain View, 
CA). Typically, 10,000 cells were analyzed using logarithmic amplification for 
fluorescence intensity. 

The results indicated that Cal 1 .8H2 did not cross-react with surface 
proteins expressed on human peripheral blood leukocytes, while the control cells, 
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neoplastic canine peripheral blood lymphocytes, were essentially all positive for 

Because the monoclonal antibody Cal 1 . 8H2 specific for the canine 
a subunit did not cross react with a human homolog, isolation of canine ajj^j 
5 DNA was deemed a necessary prerequisite to isolate a counterpart human gene 

if one existed. 

Example 2 

Affinity Purification Of Canine ct -Yhn For N-Terminal Sequencing 

Canine of-j^i was affinity purified in order to determine N-terminal 

10 amino acid sequences for oligonucleotide probe/primer design. Briefly, anti-a-j^j 

monoclonal antibody Call.8H2 was coupled to Affigel 10 chromatographic resin 
(BioRad, Hercules, CA) and protein was isolated by specific antibody-protein 
interaction. Antibody was conjugated to the resin, according to the BioRad 
suggested protocol, at a concentration of approximately 5 mg antibody per ml of 

15 resin. Following the conjugation reaction, excess antibody was removed and the 
resin blocked with three volumes of 0. 1 M ethanolamine. The resin was then 
washed with thirty column volumes of phosphate buffered saline (PBS). 

Twenty-five grams of a single dog spleen were homogenized in 250 
ml of buffer containing 0,32 M sucrose in 25 mM Tris-HCl, Ph 8.0, with 

20 protease inhibitors. Nuclei and cellular debris were pelleted with centrifugation 
at 1000 g for 15 minutes. Membranes were pelleted from the supernatant with 
centrifugation at 100,000 g for 30 minutes. The membrane pellet was 
resuspended in 200 ml lysis buffer (50 mM NaCl, 50 mM borate, pH 8.0, with 
2% NP-40) and incubated for 1 hour on ice. Insoluble material was then pelleted 

25 by centrifugation at 1(X),0(X) g for 60 minutes. Ten milliliters of the cleared 
lysate were transferred to a 15 ml polypropylene tube with 0.5 ml Call.8H2- 
conjugated Affigel 10 resin described above. The tube was incubated overnight 
at 4 • C with rotation and the resin subsequently washed with 50 column volumes 
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D-PBS. The resin was then transferred to a microfuge tube and boiled for ten 
minutes in 1 ml Laemmli (non-reducing) sample buffer containing 0. 1 M Tris- 
HCl, pH 6.8, 2% SDS, 20% glycerol and 0.002% bromophenol blue. The resin 
was pelleted by centrifugation and discarded; the supernatant was treated with 
5 1/15 volume jS-mercaptoethanol (Sigma, St. Louis, MO) and run on a 7% 

polyacrylamide gel. The separated proteins were transferred to Immobilon PVDF 
membrane (Millipore, Bedford, MA) as follows. 

The gels were washed once in deionized, Millipore-filtered water 
and equilibrated for 15-45 minutes in 10 mM 3-[cyclohexylamino]-l- 

10 propanesulfonic acid (CAPS) transfer buffer, pH 10.5, with 10% methanol. 

Immobilon membranes were moistened with methanol, rinsed with filtered water, 
and equilibrated for 15-30 minutes in CAPS transfer buffer. The initial transfer 
was carried out using a Biorad transfer apparatus at 70 volts for 3 hours. The 
Immobilon membrane was removed after transfer and stained in filtered 0.1% 

15 R250 Coomassie stain for 10 minutes. Membranes were destained in 50% 

methanol/ 10% acetic acid three times, ten minutes each time. After destaining, 
the membranes were washed in filtered water and air-dried. 

Protein bands of approximately 150 kD, 95 kD, 50 kD and 30 kD 
were detected. Presumably the 50 kD and 30 kD bands resulted from antibody 

20 contamination. N-terminal sequencing was then attempted on both the 150 kD 
and 95 kD bands, but the 95 kD protein was blocked, preventing sequencing. 
The protein band of 150 kD was excised from the membrane and directly 
sequenced with an Applied Biosystems (Foster City, CA) Model 473 A protein 
sequencer according to the manufacturer's instructions. The resulting amino acid 

25 sequence is set in SEQ ID NO: 5 using single letter amino acid designations, 

FNLDVEEPMVFQ (SEQ ID NO: 5) 



BNSDOCID; <WO 9517412A1J_> 



95/17412 



PCT/US94/14832 



- 17 - 

The identified sequence included the FNLD sequence characteristic of a subunits 
of the integrin family [Tamura, et al, J.CelLBiGl. iJi: 1593-1604 (1990)]. 

Primer Design and Attempt to Amplify Canine Qf- i^^^ Sequences 

From the N-terminal sequence information, three oligonucleotide 
probes were designed for hybridization: a) "Tommer," a fiiUy degenerate 
oligonucleotide; b) "Patmer," a partially degenerate oligonucleotide; and c) 
"Guessmer," a nondegenerate oligonucleotide based on mammalian codon usage. 
These probes are set out below as SEQ ID NOS: 6, 7 and 8, respectively. 
Nucleic acid symbols are in accordance with 37 C.F.R, §1.882 for these and all 
other nucleotide sequences herein. 

5 '-TTYAAYYTGGAYGTNGARGARCCNATGGTlSnTYCA-3(SEQ ID NO: 6) 
5 -TTCAACCTGGACGTGGAGGAGCCCATGGTGTTCCAA<SEQ ID NO: 7) 
5'-TrCAACCTGGACGTNGAASANCCCATGGTCTTCCAA-@BEQ ID NO: 8) 

Based on sequencing data, no relevant clones were detected using 
these oligonucleotides in several low stringency hybridizations to a canine 
spleen/peripheral blood macrophage cDNA library cloned into XZAP (Stratagene, 
La Jolla, CA). 

Four other oligonucleotide primers, designated 5 'Deg, 5 'Spec, 
3'Deg and 3 'Spec (as set out in SEQ ID NOS: 9, 10, 11 and 12, respectively, 
wherein Deg indicates degenerate and Spec indicates non-degenerate) were 
subsequently designed based on the deduced N-terminal sequence for attempts to 
amplify canine a^Mi sequences by PGR from phage library DNA purified from 
plate lysates of the Stratagene library described above. 



5 '-TTYAA YYTNGAYGTNGARGARCC-3 ' 
5 '-TTYAAYYTGGACGTNGAAGA-3 ' 



(SEQ ID NO: 9) 
(SEQ ID NO: 10) 
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5 '-TGRAANACCATNGGYTC-3 ' (SEQ ID NO: 11) 

5 '-TTGGAAGACCATNGGYTC-3 ' (SEQ ID NO: 12) 

The ajf^i oligonucleotide primers were paired with T3 or T7 
vector primers, as set out in SEQ ID NOS: 13 and 14, respectively, which 
5 hybridize to sequences flanking the polylinker region in the Bluescript phagemid 

found in XZAP. 

5 '-ATTAACCCTCACTAAAG-S ' (SEQ ID NO: 13) 

5 '-AATACGACTCACTATAG-3 ' (SEQ ID NO: 14) 



The PGR amplification was carried out in Tag buffer (Boehringer Mannheim, 

10 Indianapolis, IN) containing magnesium with 150 ng of library DNA, 1 fig of 

each primer, 200 /xM dNTPs and 2.5 units Tag polymerase (Boehringer 
Mannheim) and the products were separated by electrophoresis on a 1 % agarose 
gel in Tris-Acetate-EDTA (TAE) buffer with 0.25 /xg/ml ethidium bromide. DNA 
was transferred to a Hybond (Amersham, Arlington Heights, IL) membrane by 

15 wicking overnight in lOX SSPE. After transfer, the immobilized DNA was 

denatured with 0.5 M NaOH with 0.6 M NaCl, neutralized with 1.0 M Tris-HCl, 
pH 8.0, in 1.5 M NaCl, and washed with 2X SSPE before UV crosslinking with 
a Stratalinker (Stratagene) crosslinking apparatus. The membrane was incubated 
inprehybridization buffer (5X SSPE, 4X Denhardts, 0,8% SDS, 30% formamide) 

20 for 2 hr at 50'C with agitation. 

Oligonucleotide probes 5'Deg, 5 'Spec, 3'Deg and 3 'Spec (SEQ 
ID NOS: 9, 10, 11 and 12, respectively) were labeled using a Boehringer 
Mannheim kinase buffer with 100-300 fiCi yP^^-dATP and 1-3 units of 
polynucleotide kinase for 1-3 hr at 37* C. Unincorporated label was removed with 

25 Sephadex G-25 fine (Pharmacia, Piscataway, NJ) chromatography using 10 mM 

Tris-HCl, pH 8.0, 1 mM EDTA (TE) buffer and the flow-through added directly 
to the prehybridization solution. Membranes were probed for 16 hr at 42*C with 
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agitation and washed repeatedly, with a final stringency wash of IX SSPE/0. 1 % 
SDS at 50* for 15 min. The blot was then exposed to Kodak X-Omat AR film 
for 1-4 hours at -SOX. 

The oligonucleotides 5'Deg, 5 'Spec, 3'Deg and 3 'Spec only 
5 hybridized to PGR products from the reactions in which they were used as 

primers and failed to hybridize as expected to PGR products from the reactions 
in which they were not used as primers. Thus, it was concluded that none of the 
PGR products were specific for a-jMi because no product hybridized with all of 
the appropriate probes. 

10 Example 3 

Large Scale Affinity Purification Of Ganine of TMi For Internal Sequencing 

In order to provide additional amino acid sequence for primer 
design, canine ofrMl purified for internal sequencing. Three sections of 
frozen spleen (approximately 50 g each) and frozen cells from two partial spleens 

15 from adult dogs were used to generate protein for internal sequencing. Fifty 

grams of spleen were homogenized in 200-300 ml borate buffer with a Waring 
blender. The homogenized material was diluted with 1 volume of buffer 
containing 4% NP-40, and the mixture then gently agitated for at least one hour. 
The resulting lysate was cleared of large debris by centrifugation at 2000 g for 20 

20 min, and then filtered through either a Goming (Coming, NY) prefilter or a 

Goming 0.8 micron filter. The lysate was further clarified by filtration through 
the Goming 0.4 micron filter system. 

Splenic lysate and the antibody-conjugated Affigel 10 resin 
described in Example 2 were combined at a 150: 1 volume ratio in 100 ml aliquots 

25 and incubated overnight at 4*G with rocking. The lysate was removed after 
centrifugation at 1000 g for 5 minutes, combined with more antibody-conjugated 
Affigel 10 resin and incubated overnight as above. The absorbed resin aliquots 
were then combined and washed with 50 volumes D-PBS/0. 1 % Tween 20 and the 
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resin transferred to a 50 ml Biorad column. Adsorbed protein was eluted from 
the resin with 3-5 volumes of 0. 1 M glycine (pH 2.5); fractions of approximately 
900 ^1 were collected and neutralized with 100 /il 1 M Tris buffer, pH 8,0. 
Aliquots of 15 ^1 were removed from each fraction and boiled in an equal volume 
5 of 2X Laemmli sample buffer with 1/15 volume 1 M dithiothreitol (DTT). These 

samples were electrophoresed on 8 % Novex (San Diego, CA) polyacrylamide gels 
and visualized either by Coomassie stain or by silver stain using a Daiichi kit 
(Enprotech, Natick, MA) according to the manufacturer's suggested protocol. 
-Fractions which contained the largest amounts of protein were combined and 
10 concentrated by vacuum. The remaining solution was diluted by 50% with 

reducing Laemmli sample buffer and run on 1.5 mm 7% polyacrylamide gels in 
Tris-glycine/SDS buffer. Protein was transferred from the gels to Immobilon 
membrane by the procedure described in Example 2 using the Hoefer transfer 
apparatus. 

15 The protein bands corresponding to canine c^tmi were excised from 

10 PVDF membranes and resulted in approximately 47 fig total protein. The 
bands were destained in 4 ml 50% methanol for 5 minutes, air dried and cut into 
1x2 mm pieces. The membrane pieces were submerged in 2 ml 95% acetone 
at 4'C for 30 minutes with occasional vortexing and then air dried. 

20 Prior to proteolytic cleavage of the membrane bound protein, 3 mg 

of cyanogen bromide (CNBr) (Pierce, Rockford, XL) were dissolved in 1.25 ml 
70% formic acid. This solution was then added to a tube containing the PVDF 
membrane pieces and the tube incubated in the dark at room temperature for 24 
hours. The supernatant (SI) was then removed to another tube and the membrane 

25 pieces washed with 0.25 ml 70% formic acid. This supernatant (S2) was removed 

and added to the previous supernatant (SI). Two milliliters of Milli Q water were 
added to the combined supematants (SI and S2) and the solution lyophilized. The 
PVDF membrane pieces were dried under nitrogen and extracted again with 1.25 
ml 60% acetonitrile, 0.1% tetrafluoroacetic acid (TFA) at 42 'C for 17 hours. 
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This supernatant (S3) was removed and the membrane pieces extracted again with 
1,0 ml 80% acetonitrile with 0,08% TFA at 42 for 1 hour. This supernatant 
(S4) was combined with the previous supematants (SI, S2 and S3) and vacuum 
dried. 

5 The dried CNBr fragments were then dissolved in 63 ^ISM urea, 

0.4 M NH4HCO3. The fragments were reduced in 5 ^1 45 mM dithiothreitol 
(DTT) and subsequently incubated at 50* C for 15 minutes. The solution was then 
cooled to room temperature and the fragments alkylated by adding 5 /xl 100 mM 
iodoacetamide (Sigma, St. Louis, MO). Following a 15 minute incubation at 

10 room temperature, the sample was diluted with 187 ^1 Milli Q water to a final 
urea concentration of 2.0 M. Trypsin (Worthington, Freehold, NJ) was then 
added at a ratio of 1:25 (w:w) of enzyme to protein and the protein digested for 
24 hours at 37*C. Digestion was terminated with addition of 30 fil TFA. 

The protein fragments were then separated with high performance 

15 liquid chromatography (HPLC) on a Waters 625 LC system (Millipore, Milford, 
MA) using a 2.1 X 250 mm, 5 micron Vydac C-18 column (Vydac, Hesperia, 
CA) equilibrated in 0,05% TFA and HPLC water (buffer A). The peptides were 
eluted with increasing concentration of 80% acetonitrile in 0.04% TFA (buffer B) 
with a gradient of 38-75% buffer B for 65-95 minutes and 75-98% buffer B for 

20 95-105 minutes. Peptides were fractionated at a flow rate of 0.2 ml/minute and 
detected at 210 nm. 

Following fractionation, the amino acid sequence of the peptides 
was analyzed by automated Edman degradation performed on an Applied 
Biosystems Model 43 7A protein sequencer using the manufacturer's standard 

25 cycles and the Model 610A Data Analysis software program, Version 1.2. 1. All 

sequencing reagents were supplied by Applied Biosystems. The amino acid 
sequences of seven of the eight internal fragments are set out below wherein "X" 
indicates the identity of the amino acid was not certain. 
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VFQEXGAGFGQ (SEQ ID NO: 15) 

LYDXVAATGLXQPI (SEQ ID NO: 16) 

PLEYXDVIPQAE (SEQ ID NO: 17) 

FQEGFSXVLX (SEQ ID NO: 18) 

TSPTFIXMSQENVD (SEQ ID NO: 19) 

LWGAPLEVVAVXQTGR (SEQ ID NO: 20) 

LDXKPXDTA (SEQ ID NO: 21) 



Primer Design 

One internal amino acid sequence (set out in SEQ ID NO: 22) 
10 obtained was then used to design a fully degenerate oligonucleotide primer, 

designated p4(R) as set out in SEQ ID NO: 23. 

FGEQFSE (SEQ ID NO: 22) 

5 '-RAANCCYTCYTGRAAACTYTC-3 ' (SEQ ID NO: 23) 



Example 4 

15 PGR Cloning Of A Canine o^Y hfi Fragment 

The 5' portion of the canine ot-iMi 8^^^ amplified from 
double-stranded canine splenic cDNA by PGR. 



A. Generation of Double Stranded Canine Spleen cDNA 
One gram of frozen material from a juvenile dog spleen was ground 
20 in liquid nitrogen on dry ice and homogenized in 20 ml RNA-Stat 60 buffer (Tel- 

Test B, Inc, Friendswood, TX). Four ml chloroform were added, and the 
solution extracted by centrifugation at 12,000 g for 15 minutes. RNA was 
precipitated from the aqueous layer with 10 ml ethanol. Poly A"*" RNA was then 
selected on Dynal Oligo dT Dynabeads (Dynal, Oslo, Norway). Five aliquots of 
25 100 ftg total RNA were combined and diluted with an equal volume of 2X binding 
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buffer (20 mM Tris-HCl, pH 7.5, 1.0 M LiCl, 1 mM EDTA, 0.1% SDS). RNA 
was then incubated 5 minutes with the Oligo dT Dynabeads (1 .0 ml or 5 mg 
beads for all the samples). Beads were washed with buffer containing 10 mM 
Tris-HCl, pH 7,5, 0.15 M LiCl, 1 mM EDTA and 0.1% SDS, according to the 
5 manufacturer's suggested protocol prior to elution of poly A"*" mRNA with 2 mM 

EDTA, pH 7.5. Double-stranded cDNA was then generated using the eluted poly 
A"^ mRNA and the Boehringer Mannheim cDNA Synthesis Kit according to the 
manufacturer's suggested protocol. 

B, Isolation of a Partial Canine a^j^j cDNA 

10 Oligonucleotide primers 5 'Deg (SEQ ID NO: 9) and p4(R) (SEQ 

ID NO: 23) were employed in a standard PGR reaction using 150 ng double- 
stranded cDNA, 500 ng of each primer, 200 fiM dNTPs and 1.5 units Tag 
polymerase (Boehringer Mannheim) in Tag buffer (Boehringer Mannheim) with 
magnesium. The resulting products (1 /xl of the original reaction) were subjected 

15 to a second round of PGR with the same primers to increase product yield. This 

band was eluted from a 1 % agarose gel onto Schleicher & Schuell (Keene, NH) 
NA45 paper in a buffer containing 10 mM Tris-HCl, pH 8, 1 mM EDTA, 1.5 M 
NaCl at 65 "C, precipitated, and ligated into the pCR'™II vector (Invitrogen, San 
Diego, CA) using the TA cloning kit (Invitrogen) and the manufacturer's 

20 suggested protocol. The ligation mixture was transformed by electroporation into 

XL-1 Blue bacteria (Stratagene). One clone, 2.7, was determined to contain 
sequences corresponding to of-n^l Peptide sequences which were not utilized in 
design of the primers. 

Sequencing was performed with an Applied Biosystems 373 A DNA 

25 sequencer (Foster City, CA) with a Dye-deoxy terminator cycle sequence kit 
(ABI) in which fluorescent-labeled dNTPs were incorporated in an asymmetric 
PCR reaction [McCabe, "Production of Single Stranded DNA by Asymmetric 
PCR," in PCR Protocols: A Guide to Methods and Applications . Innis, ec aL 
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(eds.) pp. 76-83 Academic Press: New York (1990)] as follows. Samples were 
held at 96''C for 4 minutes and subjected to 25 cycles of the step sequence: 96**C, 
for 15 seconds; 50**C for 1 second; 60^*0 for 4 minutes. Sequence data was 
automatically down-loaded into sample files on the computer that included 
5 chromatogram and text files. The sequence of the entire insert of clone 2.7 is set 

out in SEQ ID NO: 24. 

Attempts to isolate the full length canine c^xMl cDNA from the 
Stratagene library (as described in Example 2) were unsuccessful. Approximately 
1 X 10^ phage plaques were screened by hybridization under low stringency 

10 conditions using 30% formamide with clone 2.7 as a probe, but no positive clones 

resulted. Attempts to amplify relevant sequences downstream from those 
represented in clone 2.7 using specific oligonucleotides derived from clone 2.7 or 
degenerate primers based on amino acid sequence from other peptide fragments 
paired with a degenerate oligonucleotide based on the conserved or subunit amino 

15 acid motif GFFKR [Tamura, et aL, supra] were also unsuccessful. 

Example 5 

Cloning Of A Putative Human Homolog Of Canine c^x Ml 

To attempt the isolation of a human sequence homologous to canine 
^TMi approximately 1 kb canine ajMi fragment from clone 2.7 was used as 
20 a probe. The probe was generated by PGR under conditions described in 

Example 2 using NT2 (as set out in SEQ ID NO: 25) and p4(R) (SEQ ID NO: 
23) primers. 

5 '-GTNTTYCARGARGAYGG-3 ' (SEQ ID NO: 25) 

The PGR product was purified using the Qiagen (Ghatsworth, GA) Quick Spin kit 
25 and the manufacturer's suggested protocol. The purified DNA (200 ng) was 
labeled with 200 /xCi of^^PdCTP using the Boehringer Mannheim Random Prime 
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Labelling kit and the manufacturer's suggested protocoL Unincorporated isotope 
was removed with Sephadex G25 (fine) gravity chromatography. The probe was 
denatured with 0.2 N NaOH and neutralized with 0.4 M Tris-HCl, pH 8,0, before 
use. 

5 Colony lifts on Hybond filters (Amersham) of a human spleen 

cDNA library in pCDNA/Amp (Invitrogen, San Diego, CA) were prepared. The 
filters were initially denatured and neutralized as described in Example 2 and 
subsequently incubated in a prehybridization solution (8 ml/filter) with 30% 
formamide at 50'C with gentle agitation for 2 hours. Labeled probe as described 

10 above was added to this solution and incubated with the filters for 14 hours at. 

42 *C, The filters were washed twice in 2X SSC/0.1% SDS at 37*C and twice 
in 2X SSC/0. 1 % SDS at 50 X. Final stringency washes were IX SSC/0. 1 % SDS , 
twice at 65" C (IX SSC is 150 mM NaCl, 15 mM sodium citrate, pH 7.0). 
Filters were exposed to Kodak X-Omat AR film for six hours with an intensifying 

15 screen. Colonies giving signals on duplicate lifts were streaked on LB medium 

with magnesium (LBM)/carbenicillin plates and incubated overnight at 37 *C. 
Resulting streaked colonies were lifted with Hybond filters and these filters were 
treated as above. The filters were hybridized under more stringent conditions 
with the 1 kb probe from clone 2.7, labeled as previously described, in a 50% 

20 formamide hybridization solution at 50 *C for 3 hours. Probed filters were 

washed with a final stringency of 0.1 X SSC/0. 1% SDS at 65'C and exposed to 
Kodak X-Omat AR film for 2.5 hours at -80 "C with an intensifying screen. 
Positive colonies were identified and cultured in LBM/carbenicillin medium 
overnight. DNA from the cultures was prepared using the Promega Wizard 

25 miniprep kit according to the manufacturer's suggested protocol and the resulting 

DNA was sequenced. 

The initial screening resulted in 18 positive clones, while the 
secondary screening under more stringent hybridization conditions produced one 
positive clone which was designated 19 A2. The DNA and deduced amino acid 
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sequences of the human clone 19A2 are set out in SEQ ID NOS: 1 and 2, 
respectively. 

Characteristics Of The Human cDNA and Predicted Polypeptide 

Clone 19 A2 encompasses the entire coding region for the mature 
5 protein, plus 48 bases (16 amino acid residues) of the 5' upstream signal 
sequence and 241 bases of 3 ' untranslated sequence which do not terminate in a 
polyadenylation sequence. The core molecular weight of the mature protein is 
predicted to be around 125 kD, The extracellular domain is predicted to 
encompass approximately amino acid residues 17 through 1108 of SEQ ID NO: 

10 2- This extracellular region is contiguous with about a 20 amino acid region 

homologous to the human CD 11c transmembrane region (residues 1109 through 
1128 of SEQ ID NO: 2). The cytoplasmic domain comprises approximately 30 
amino acids (about residues 1129 through 1161 of SEQ ID NO: 2). The protein 
also contains a region (around residues 150 through 352) of approximately 202 

15 amino acids homologous to the I (insertion) domain common to CDlla, CDllb 

and CDllc [Larson and Springer, supra], [Shaw, et aL, J.BioLChem. 
26P:6016-6025 (1994)] and in VLA-1 and VLA-2, [Tamura, etal, supra]. The 
I domain in other integrins has been shown to participate in ICAM binding 
[Landis, et al , J. CelLBiol 120\ 1519-1527 (1993); Diamond, et aL , 7, CelLBioL 

20 720:1031-1043 (1993)], suggesting that of^ may also bind members of the ICAM 

family of surface molecules. This region has not been demonstrated to exist in 
any other integrin subunits. 

The deduced amino acid sequence of shows approximately 36% 
identity to that of CDlla, approximately 60% identity to CDllb and 

25 approximately 66% identity to CDl Ic. An alignment of amino acid sequences for 

(CDllb SEQ ID NO: 3), CDllc (SEQ ID NO: 4) and (SEQ ID NO: 2) is 
presented in Figure 1. 
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The cytoplasmic domains of ot subunits in &2 integrins are typically 
distinct from one another within the same species, while individual a subunits 
show high degrees of homology across species boundaries. Consistent with these 
observations, the cytoplasmic region of differs markedly from CDlla, CDllb, 
5 and CDl Ic except for a membrane proximal GFFKR amino acid sequence which 
has been shown to be conserved among all a integrins [Rojiani, et aL, 
Biochemistry 30: 9859-9866 (1991)]. Since the cytoplasmic tail region of integrins 
has been implicated in "inside out" signaling and in avidity regulation [Landis et 
aL, supra], it is possible that interacts with cytosolic molecules distinct from 

10 those interacting with CDlla, CDllb, and CDllc, and, as a result, participates 
in signaling pathways distinct from those involving other ^2 integrins. 

The extracellular domain of contains a conserved DGSGS amino 
acid sequence adjacent the I-domain; in CDllb, the DGSGS sequence is a 
metal-binding region required for ligand interaction [Michishita, et aL Cell 

15 72:857-867 (1993)]. Three additional putative cation binding sites in CDllb and 

CDllc are conserved in the sequence at amino acids 465-474, 518-527, and 
592-600 in clone 19A2 (SEQ ID NO: 1). The I-domain is 36%, 62%, and 
57% identical to the corresponding regions in CDlla, CDllb, and CDllc, 
respectively, and the relatively low sequence homology in this region suggests that 

20 may interact with a set of extracellular proteins distinct from proteins with 

which other known 02 integrins interact. Alternatively, the affinity of for 
known ^2 integrin ligands, for example, ICAM-1, ICAM-2 and/or ICAM-R, may 
be distinct from that demonstrated for the other ^2 integrin/ICAM interactions. 
[See Example 12.] 



4 
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Example 6 

Northern Analysis of Human Expression in Tissues 

In order to determine the relative level of expression and tissue 
specificity of a^, Northern analysis was performed using fragments from clone 
5 19A2 as probes. Approximately 10 ftg of total KNA from each of several human 

tissues or cultured cell lines were loaded on a formaldehyde agarose gel in the 
presence of 1 /xg of ethidium bromide. After electrophoresis at 100 V for 4 hr, 
the RNA was transferred to a nitrocellulose membrane (Schleicher & Schuell) by 
wicking in lOX SSC overnight. The membrane was baked 1.5 hr at 80'C under 

10 vacuum. Prehybridization solution containing 50% formamide in 3-(N- 

morpholino)propane sulfonic acid (MOPS) buffer was used to block the membrane 
for 3 hr at 42 X. Fragments of clone 19A2 were labeled with the Boehringer 
Mannheim Random Prime kit according to the manufacturer's instructions 
including both aP^^dCTP and aP^^dTTP. Unincorporated label was removed on 

15 a Sephadex G25 column in TE buffer. The membrane was probed with 1.5 x 10^ 

counts per ml of prehybridization buffer. The blot was then washed successively 
with 2X SSC/0.1% SDS at room temperature, 2X SSC/0.1% SDS at 42 'C, 2X 
SSC/0.1% SDS at 50 "C, IX SSC/0.1% SDS at 50 0.5X SSC/0.1% SDS at 
50* C and 0. IX SSC/0. 1 % SDS at 50*C. The blot was then exposed to film for 

20 19 hr. 

Hybridization using a BstXl fragment from clone 19A2 
(corresponding to nucleotides 2011 to 3388 in SEQ ID NO: 1) revealed a weak 
signal in the approximately 5 kb range in liver, placenta, thymus, and tonsil total 
RNA. No signal was detected in kidney, brain or heart samples. The amount of 
25 RNA present in the kidney lane was minimal, as determined with ethidium 

bromide staining. 

When using a second fragment of clone 19A2 (encompsissing the 
region from bases 500 to 2100 in SEQ ID NO: 1), RNA transcripts of two 
different sizes were detected in a human multi-tissue Northern (MTN) blot using 
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polyA"*" RNA (Clontech). An approximately 6.5 kb band was observed in spleen 
and skeletal muscle, while a 4.5 kb band was detected in lung and peripheral 
blood leukocytes. The variation in sizes observed could be caused by tissue 
specific polyadenylation, cross reactivity of the probe with other integrin family 
5 members, or hybridization with alternatively spliced mRNAs. 

Northern analysis using a third fragment from clone 19A2, 
spanning nucleotides 2000 to 3100 in SEQ ID NO: 1, gave results consistent with 
those using the other clone 19A2 fragments. 

RNA from three myeloid lineage cell lines was also probed using 

10 the fragments corresponding to nucleotides 500 to 2100 and 2000 to 3100 in SEQ 
ID NO:l. A THP-1 cell line, previously stimulated with PMA, gave a diffuse 
signal in the same size range (approximately 5.0 kb), with a slightly stronger 
intensity than the tissue signals. RNA from unstimulated and DMSO-stimulated 
HL-60 cells hybridized with the probe at the same intensity as the tissue 

15 samples, however, PMA treatment seemed to increase the signal intensity. Since 

PMA and DMSO drive HL-60 cell differentiation toward monocyte/macrophage 
and granulocyte pathways, respectively, this result suggests enhanced 
expression in monocyte/macrophage cell types. U937 cells expressed the 
message and this signal did not increase with PMA stimulation. No band was 

20 detected in Molt, Daudi, H9, JY, or Jurkat cells. 

Example 7 

Transient Expression of Human Constructs 

A. Generation of expression constructs 

The human clone 19A2 lacks an initiating methionine codon and 
25 possibly some of the 5 ' signal sequence. Therefore, in order to generate a human 
expression plasmid containing 19A2 sequences, two different strategies were used. 
In the first, two plasmids were constructed in which signal peptide sequences 
derived from genes encoding either GDI lb or CDllc were spliced into clone 
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19A2 to generate a chimeric sequence. In the second approach, a third 
plasmid was constructed in which an adenosine base was added at position 0 in 
clone 19A2 to encode an initiating methionine. 

The three plasmids contained different regions which encoded the 
5 5 ' portion of the sequence or the chimeric sequence. The o; ^ region was 

PGR amplified (see conditions in Example 2) with a specific 3 ' primer BamRev 
(set out below in SEQ ID NO: 25) and one of three 5 ' primers. The three 5 ' 
primers contained in sequence: (1) identical nonspecific bases at ]>ositions 1-6 
allowing for digestion, an £coRI site from positions 7-12 and a consensus Kozak 

10 sequence from positions 13-18; (2) a portion of the CDllb (primer ERIB) or 

GDI Ic (primer ERIG) signal sequence, or an adenosine (primer ERID); and (3) 
an additional 15-17 bases specifically overlapping 5 ' sequences from clone 19A2 
to allow primer annealing. Primers ERIE, ERIG or ERID are set out in SEQ 
ID NOS: 27, 28 or 29, respectively, where the initiating methionine codon is 

15 underlined and the EcdRI site is double underlined. 

5 -GGAGTGTCAGGATGCGGGTG-3 ' (SEQ ID NO: 26) 

5 -AGTTAC GAATrC GGCAGCATGGCTGTAGGGGTGCTItSEgXin3^g3: 27) 

20 5 -AGTTAC GAATTG GGCACCATGACTCGGACTGTGCTTtSEOIIXMa: 28) 

5 -AGTTAC GAATTG GCCACCATGACGTrCGGCACTGTQaEO ID NO: 29) 

The resulting PGR product was digested with EcoKL and BamHh 

All three plasmids contained a common second region (to be 
25 inserted immediately downstream from the 5 ' region described in the previous 

paragraph) including the 3 ' end of the clone. The second region, which 
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extended from nucleotide 625 into the Xbal site in the vector 3 ' polylinker region 
of clone 19A2, was isolated by digestion of clone 19A2 with BaniHl and Xbal. 

Three ligation reactions were prepared in which the 3' 
BamSLIXbal fragment was ligated to one of the three 5' EcdSllBamHl 

5 fragments using Boehringer Mannheim ligase buffer and T4 ligase (1 unit per 

reaction). After a 4 hour incubation at 14**C, an appropriate amount of vector 
pcDNA.3 (Invitrogen) digested with EcdSl and Xbal was added to each reaction 
with an additional unit of ligase. Reactions were allowed to continue for another 
14 hours. One tenth of the reaction mixture was then transformed into competent 

10 XL-1 Blue cells. The resulting colonies were cultured and the DNA isolated as 
in Example 5. Digestion with EcdSl identified three clones which were positive 
for that restriction site, and thus, the engineered signal sequences. The clones 
were designated pATM.Bl (CDllb/aa, from primer ERIB), pATM.ClO 
(CDUc/a^j, from primer ERIC) and pATM.D12 (adenosine/or^ from primer 

15 ERld). The presence of the appropriate signal sequences in each clone was 

verified by nucleic acid sequencing. 

B. Transfection of COS Cells 

Expression from the plasmids discussed above was effected by 
cotransfection of COS cells with the individual plasmids and a CD 18 expression 

20 plasmid, pRC.CDlS. As a positive control, COS cells were also co-transfected 
with the plasmid pRC.CDlS and a CDlla expression plasmid, pDC.CDllA. 

Cells were passaged in culture medium (DMEM/10%FBS/pen- 
strep) into 10 cm Coming tissue culture-treated petri dishes at 50% confluency 16 
hours prior to transfection. Cells were removed from the plates with Versene 

25 buffer (0.5 mM NaEDTA in PBS) without trypsin for all procedures. Before 
transfection, the plates were washed once with serum-free DMEM. Fifteen 
micrograms of each plasmid were added to 5 ml transfection buffer (DMEM with 
20 yiglml DEAE-Dextran and 0.5 mM chloroquine) on each plate. After 1.5 
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hours incubation at SVC, the cells were shocked for 1 minute with 5 ml 
DMEM/10% DMSO. This DMSO solution was then replaced with 10 ml/plate 
culture medium. 

Resulting transfectants were analyzed by ELISA, FACS, and 
5 immunoprecipitation as described in Examples 8, 9, and 10. 

Example 8 

ELISA Analysis of COS Transfectants 

In order to determine if the COS cells co-transfected with CD 18 
expression plasmid pRC.CDlS and an plasmid expressed on the cell surface 

10 in association with CD 18, ELISAs were performed using primary antibodies 

raised against CD18 (e.g., TSl/18 purified from ATCC HB203). As a positive 
control, ELISAs were also performed on cells co-transfected with the CD 18 
expression plasmid and a CDlla expression plasmid, pDC.CDllA. The primary 
antibodies in this control included CD18 antibodies and anti-CDlla antibodies 

15 (e.^., TSl/22 purified from ATCC HB202). 

For ELISA, cells from each plate were removed with Versene 
buffer and transferred to a single 96-well flat-bottomed Coming tissue culture 
plate. Cells were allowed to incubate in culture media 2 days prior to assay. The 
plates were then washed twice with 150 /nl/well D-PBS/0.5 % teleost skin gelatin 

20 (Sigma) solution. This buffer was used in all steps except during the 

development. All washes and incubations were performed at room temperature. 
The wells were blocked with gelatin solution for 1 hour. Primary antibodies were 
diluted to 10 ^g/ml in gelatin solution and 50 fil were then added to each well. 
Triplicate wells were set up for each primary antibody. After 1 hour incubation, 

25 plates were washed 3X with 150 /xl/well gelatin solution. Secondary antibody 

(goat anti-mouse Ig/HRP-Fc specific [Jackson, West Grove, PA]) at a 1:3500 
dilution was added at 50 ^1/well and plates were incubated for 1 hour. After 
three washes, plates were developed for 20 minutes with 100 /il/well o- 
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phenyldiamine (OPD) (Sigma) solution (1 mg/ml OPD in citrate buffer) before 
addition of 50 /il/well 15% sulfuric acid. 

Analysis of transfectants in the ELISA format with anti-CD18 
specific antibodies revealed no significant expression above background in cells 
5 transfected only with the plasmid encoding CD18. Cells co-transfected with 

plasmid containing GDI la and CD 18 showed an increase in expression over 
background when analyzed with CD18 specific antibodies or with reagents 
specific for CD 11a. Further analysis of cells co-transfected with plasmids 
encoding CD 18 and one of the expression constructs (pATM.ClO or 
10 pATM.D12) revealed that cell surface expression of CD18 was rescued by 

concomitant expression of a^. The increase in detectable CD18 expression in 
COS cells transfected with pATM.ClO or pATM.D12 was comparable to that 
observed in co-transfected CDlla/CD18 positive control cells. 

Example 9 

15 FAGS Analysis of COS Transfectants 

For FAGS analysis, cells in petri dishes were fed with fresh culture 
medium the day after transfection and allowed to incubate 2 days prior to the 
assay. Transfectant cells were removed from the plates with 3 ml Versene, 
washed once with 5 ml FAGS buffer (DMEM/2% FBS/0,2% sodium azide) and 

20 diluted to 500,000 cells/sample in 0. 1 ml FACS buffer. Ten microliters of either 

1 mg/ml FITG-conjugated GD18, GDI la, or GDI lb specific antibodies (Becton 
Dickinson) or 800 fxg/nd CFSE-conjugated murine 23F2G (anti-CD 18) (ATCG 
HB11081) were added to each sample. Samples were then incubated on ice for 
45 minutes, washed 3X with 5 ml/wash FACS buffer and resuspended in 0,2 ml 

25 FAGS buffer. Samples were processed on a Becton Dickinson FAGscan and the 

data analyzed using Lysys II software (Becton Dickinson), 

COS cells transfected with CD 18 sequences only did not stain for 
GD18, GDlla or GDllb. When co-transfected with CDlla/GD18, about 15% 
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of the cells stained with antibodies to CDlla or CD18. All cells transfected with 
CD 18 and any of^j construct resulted in no detectable staining for CD 11a and 
CDUb. The pATM.Bl, pATM.ClO and pATM.D12 groups stained 4%, 13% 
and 8% positive for CD18, respectively. Fluorescence of the positive population 
5 in the CDlla/CD18 group was 4-fold higher than background. In comparison, 

the co-transfection of constructs with the CD 18 construct produced a positive 
population that showed a 4- to 7-fold increase in fluorescence intensity over 
background. 

Example 10 

10 Biotin-Labeled Immunoprecipitation of 

Human ofj ^CDlS Complexes from Co-transfected COS Cells 

Immunoprecipitation was attempted on cells co-transfected with 
CD 18 and each of the expression plasmids separately described in Example 7 
in order to determine if of^ could be isolated as part of the a0 heterodimer 

15 complex characteristic of integrins. 

Transfected cells (1-3 x 10^ cells/group) were removed from petri 
dishes with Versene buffer and washed 3 times in 50 ml/group D-PBS. Each 
sample was labeled with 2 mg Sulpho-NHS Biotin (Pierce, Rockford, IL) for 15 
minutes at room temperature. The reaction was quenched by washing 3 times in 

20 50 ml/sample cold D-PBS, Washed cells were resuspended in 1 ml lysis buffer 

(1 % NP40, 50 mM Tris-HCl, pH 8,0, 0.2 M NaCl, 2 mM Ca"^ + , 2 mM Mg+ + , 
and protease inhibitors) and incubated 15 minutes on ice. Insoluble material was 
pelleted by centrifugation at 10,000 g for 5 minutes, and the supernatant removed 
to fresh tubes. In order to remove material non-specifically reactive with mouse 

25 immunoglobulin, a pre-clearance step was initially performed. Twenty-five 

micrograms of mouse immunoglobulin (Cappel, West Chester, PA) was incubated 
with supematants at 4-C. After 2.5 hr, 100 fil (25 fig) rabbit anti-mouse Ig 
conjugated Sepharose (prepared from Protein A Sepharose 4B and rabbit anti- 
mouse IgG, both from Zymed, San Francisco, CA) was added to each sample; 
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incubation was continued at 4-C with rocking for 16 hours. Sepharose beads 
were removed from the supematants by centrifugation. After pre-clearance, the 
supematants were then treated with 20 fig anti-CD18 antibody (TS1.18) for 2 
hours at 4*C. Antibody/antigen complexes were isolated from supematants by 

5 incubation with 100 ^tl/sample rabbit and-mouse/Protein A-sepharose preparation 

described above. Beads were washed 4 times with 10 mM HEPES, 0.2 M NaCl, 
and 1% Triton-X 100. Washed beads were pelleted and boiled for 10 minutes in 
20 fil 2X Laemmli sample buffer with 2% /S-mercaptoethanol. Samples were 
centrifuged and run on an 8% prepoured Novex polyacrylamide gel (Novex) at 

10 100 V for 30 minutes. Protein was transferred to nitrocellulose membranes 
(Schleicher & SchudI) in TBS-T buffer at 200 mAmps for 1 hour. Membranes 
were blocked for 2 hr with 3% BSA in TBS-T. Membranes were treated with 
1:6000 dilution of Strep-avidin horse radish peroxidase (POD) (Boehringer 
Mannheim) for 1 hour, followed by 3 washes in TBS-T. The Amersham 

15 Enhanced Chcmiluminescence kit was then used according to the manufacturer's 

instructions to develop the blot. The membrane was exposed to Hyperfilm MP 
(Amersham) for 0.5 to 2 minutes, 

Immunoprecipitation of CD18 complexes from cells transfected 
with pRC.CDlS and either pATM.Bl, pATM.ClO or pATM.D12 revealed 

20 surface expression of a heterodimeric species consisting of approximately 100 kD 
jS chain, consistent with the predicted size of CD 18, and an a chain of 
approximately 150 kD, corresponding to a^. 

Example 11 

Stable Transfection of Human in Chinese Hamster Ovary Cells 
25 To determine whether ot^ is expressed on the cell surface as a 

heterodimer in association with CD18, cDNAs encoding each chain were both 
transiently and stably transfected into a cell line lacking both and CD18. 
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For these experiments, cDNA was augmented with additional 
leader sequences and a Kozak consensus sequence, as described in Example 7, 
and subcloned into expression vector pcDNA3. The final construct, designated 
pATM.D12, was co-transfected with a modified commercial vector, pDCl.CDlS 
encoding human CD 18 into dihydrofolate reductase (DHFR)" Chinese hamster 
ovary (CHO) cells. The plasmid pDCLCDlS encodes a DHFR*^ marker and 
transfectants can be selected using an appropriate nucleoside-deficient medium. 
The modifications which resulted in pDCl.CDlS are as follows. 

The plasmid pRC/CMV (Invitrogen) is a mammalian expression 
vector with a cytomegalovirus promoter and ampicillin resistance marker gene. 
A DHFR gene ft-om the plasmid pSCl 190-DHFR was inserted into pRC/CMV 5 ' 
of the SV40 origin of replication. In addition, a polylinker from the 5 ' region 
of the plasmid pHF2G-DHF was ligated into the pRC/CMV/DHFR construct, 3 ' 
to the DHFR gene. CD 18 encoding sequences are subsequently cloned into the 
resulting plasmid between the 5 ' flanking polylinker region and the bovine growth 
hormone poly A encoding region. 

Surface expression of CD 18 was analyzed by flow cytometry using 
the monoclonal antibody TSl/18. Heterodimer formation detected between 
and CD 18 in this cell line was consistent with the immunoprecipitation described 
in Example 10 with transient expression in COS cells. 

Elxample 12 

Human orj binds to ICAM-R in a CDlS-dependent fashion 

In view of reports that demonstrate interactions between the 
leukocyte integrins and intercellular adhesion molecules (ICAMs) which mediate 
cell-cell contact [Hynes, Cell 5P: 11-25 (1992)], the ability of CHO cells 
expressing ad/CD18 to bind ICAM-1, ICAM-R, or VCAM-1 was assessed by two 
methods. 
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In replicate assays, soluble ICAM-1, ICAM-R, or VCAM-1 IgGl 
fusion proteins were immobilized on plastic and the ability of ct^/CDlS CKO 
transfected cells to bind the immobilized ligand was determined. Transfected cells 
were labeled internally with calcein, washed in binding buffer (RPMI with 1% 
5 BSA), and incubated in either buffer only (with or without 10 ng/ml PMA) or 

buffer with anti-CD 18 monoclonal antibodies at 10 fig/rrd. Transfected cells were 
added to 96-well Immulon 4 microtiter plates previously coated with soluble 
ICAM-1/IgGl, ICAM-R/IgGl or VCAM-1/IgGl fusion protein, or bovine serum 
albumin (BSA) as a negative control. Design of the soluble forms of these 

10 adhesion molecules is described and fully disclosed in co-pending and co-owned 
U.S. Patent Application Serial No. 08/102,852, filed August 5, 1993. Wells were 
blocked with 1 % BSA in PBS prior to addition of labeled cells. After washing 
the plates by immersion in PBS with 0.1% BSA for 20 minutes, total fluorescence 
remaining in each well was measured using a Cytofluor 2300 (Millipore, Milford, 

15 MA). 

In experiments with immobilized ICAMs, a^CDlS co-transfectants 
consistently showed a 3-5 fold increase in binding to ICAM-R/IgGl wells over 
BSA coated wells. The specificity and CD18-dependence of this binding was 
demonstrated by the inhibitory effects of anti-CD18 antibody TSl/18. Thehindng 

20 of cells transfected with GDI la/CD18 to ICAM-1/IgGl wells was comparable to 
the binding observed with BSA coated wells. CDlla/CD18 transfected cells 
showed a 2-3 fold increase in binding to ICAM-1/IgGl wells only following 
pretreatment with PMA. PMA treatment of a^/CDlS transfectants did not affect 
binding to ICAM-1/IgGl or ICAM-R/IgGl wells. No detectable binding of 

25 Qr^j/CD18 transfectants to VCAM-1 /IgGl wells was observed. 

Binding of ad/CD18-transfected cells to soluble ICAM-1 /IgGl, 
ICAM-R/IgGl, or VCAM-1 /IgGl fusion proteins was determined by flow 
cytometry. Approximately one million a^i/CDlS-transfected CHO cells (grown in 
spinner flasks for higher expression) per measurement were suspended in 100 /xl 
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binding buffer (RPMI and 1% BSA) with or without 10 /ig/ml anti-CD18 
antibody. After a 20 minute incubation at room temperature, the cells were 
washed in binding buffer and soluble ICAM-1/IgGl or ICAM-R/IgGl fusion 
protein was added to a final concentration of 5 fig/ml. Binding was allowed to 
5 proceed for 30 minute at 37°C, after which the cells were washed three times and 

resuspended in 100 /xl binding buffer containing FTTC-conjugated sheep anti- 
human IgGl at a 1:100 dilution. After a 30 minute incubation, samples were 
washed three times and suspended in 200 fil binding buffer for analysis with a 
Becton Dickinson FACScan. 

10 Approximately 40-50% of the aj/CD18 transfectants indicated 

binding to ICAM-R/IgGl, but no binding to ICAM-1/IgGl or VCAM-1/IgGl 
proteins. Pretrcatmcnt of transfected cells with PMA has no effect on a^CDlS 
binding to either ICAM-1/IgGl, ICAM-R/IgGl or VCAM-1/IgGl, which was 
consistent with the immobilized adhesion assay. Binding by ICAM-R was 

15 reduced to background levels after treatment of a^/CD18 transfectants with anti- 

CD18 anubody TSl/18. 

The collective data from these two binding assays illustrate that 
Qfj/CDIS binds to ICAM-R and does so preferentially as compared to ICAM-1 
and VCAM-1. The a^CDlS binding preference for ICAM-R over ICAM-1 is 

20 opposite that observed with CD 11 a/CD 18 and CD 1 lb/CD 18. Thus modulation 

of a^j/CD18 binding may be expected to selectively affect normal and pathologic 
immune function where ICAM-R plays a prominent role. Moreover, results of 
similar assays, in which antibodies immunospecific for various extracellular 
domains of ICAM-R were tested for their ability to inhibit binding of ICAM-R 

25 to aj/CD18 transfectants, indicated that a^/CD18 and CDl la/CD18 interact with 

different domains of ICAM-R. 

The failure of CDlla/CD18 to bind ICAM-1/IgGl or ICAM- 
R/IgGl in solution suggests that the affinity of binding between CDlla/CD18 and 
ICAM-1 or ICAM-R is too low to permit binding in solution. Detection of 
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aj/CD18 binding to ICAM-R/IgGl, however, suggests an unusually high binding 
affinity. 

Binding to iC3b 

Complement component C3 can be proteolytically cleaved to form 
5 the complex iC3b, which initiates the alternative pathway of complement 
activation and leads ultimately to cell-mediated destruction of a target. Both 
CD lib and CD 11c have been implicated in iC3b binding and subsequent 
phagocytosis of iC3b-coated particles. A peptide fragment in the CDl lb I domain 
has recently been identified as the site of iC3b interaction [Ueda, et ah, 
10 Proc.NatLAcad.ScL (USA) Pi: 10680- 10684 (1994)]. The region of iC3b bindmg 

is highly conserved in CDl lb, CDllc, and aj, suggesting an a^/iC3b binding 
interaction. 

Binding of oc^ to iC3b is performed using transfectants or cell lines 
naturally expressing (for example, PMA-stimulated HL60 cells) and iC3b- 
15 coated sheep red blood cells (sRBC) in a rosette assay [Dana, et aL , /. ClinJnvesL 
7i: 153-159 (1984)]. The abilities of aj/CD18 CHO transfectants, VLA4-CHO 
transfectants (negative control) and PMA-stimulated HL60 cells (positive control) 
to form rosettes are compared in the presence and absence of an anti-CD 18 
monoclonal antibody (for example TS 1/1 8.1). 

20 Example 13 

Screening by Scintillation Proximity Assay 

Specific inhibitors of binding between the ligands of the present 
invention and their binding partners (qtj ligand/anti-ligand pair) may be 
determined by a variety of means, such as scintillation proximity assay techniques 
25 as generally described in U.S. Patent No. 4,271,139, Hart and Greenwald, 

MoLImmunoL i2:265-267 (1979), and Hart and Greenwald, J.Nuc.Med, 
20:1062-1065 (1979), each of which is incorporated herein by reference. 
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Briefly, one member of the ligand/anti-ligand pair is bound to 
a solid support. A fluorescent agent is also bound to the support. Alternatively, 
the fluorescent agent may be integrated into the solid support as described in U.S. 
Patent No. 4,568,649, incorporated herein by reference. The non-support bound 
member of the a ^ ligand/anti-ligand pair is labeled with a radioactive compound 
that emits radiation capable of exciting the fluorescent agent. When the ligand 
binds the radiolabeled anti-ligand, the label is brought sufficiently close to the 
support-bound fluorescer to excite the fluorescer and cause emission of light. 
When not bound, the label is generally too distant from the solid support to excite 
the fluorescent agent, and light emissions are low. The emitted light is measured 
and correlated with binding between the ligand and the anti-ligand. Addition of 
a binding inhibitor to the sample will decrease the fluorescent emission by keeping 
the radioactive label from being captured in the proximity of the solid support. 
Therefore, binding inhibitors may be identified by their effect on fluorescent 
emissions from the samples. Potential anti-ligands to may also be identified 
by similar means. 

Example 14 

Soluble Human Expression Constructs 

The expression of full-length, soluble human aj/CD18 
heterodimeric protein provides easily purified material for immunization and 
binding assays. The advantage of generating soluble protein is that it can be 
purified from supematants rather than from cell lysates (as with full-length 
membrane-bound a^/CDlZ); recovery in therefore improved and impurities 
reduced. 

The soluble oc^ expression plasmid was constructed as follows. A 
nucleotide fragment corresponding to the region from bases 0 to 3161 in SEQ ID 
NO: 1, cloned into plasmid pATM.D12, was isolated by digestion with HindOl 
and Aatll, A PGR fragment corresponding to bases 3130 to 3390 in SEQ ID NO: 
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1, overlapping the HindSSUAafll fragment and containing an addition Mlul 
restriction site at the 3" terminus, was amplified from pATM.B12 with primers 
sHAD.5 and sHAD.3 set out in SEQ ID NOS: 30 and 31, respectively. 

5 '.TrGCTGACTGCCTGCAGTTC-3 ' (SEQ ID NO: 30) 

5 5 '.GTrCTGACGCGTAATGGCATTGTAGACCTCGTCnTC(SEQ ID NO: 31) 

The PGR amplification product was digested with AatU and Mlul and ligated to 
the Hifunn/Aatn fragment. The resulting product was ligated into HindRVMlul" 
digested plasmid pDCl.s. 

This construct is co-expressed with soluble CD 18 in stably 
10 transfected CHO cells, and expression is detected by autoradiographic 

visualization of immunoprecipitated CD 18 complexes derived from ^^S-methionine 
labeled cells. The construct is also co-expressed with CD 18 in 293 cells 
[Berman, etal, J.CelLBiochem. 52:183-195 (1993)]. 

Soluble Human ot^ T Domain Expression Constructs 

15 It has previously been reported that the I domain in CDl la can be 

expressed as an independent structural unit that maintains ligand binding 
capabilities and antibody recognition [Randi and Hogg, J.BioLChem. 25P: 12395- 
12398 (1994); Zhout, et al, J.BioLChem. 25P: 17075-17079 (1994); Michishita, 
et aL, Cell 72:857-867 (1993)]. To generate a soluble fusion protein comprising 

20 the I domain and human IgG4, the I domain is amplified by PCR using 

primers designed to add flanking BamKl and Xhol restriction sites to facilitate 
subcloning. These primers are set out in SEQ ID NOS: 32 and 33 with restriction 
sites underlined. 

5 -ACGTATGCA GGATCCC ATCAAGAGATGGACATCGCTSEO ID NO: 32) 
25 5 '-ACTGCATGTCICeAGGCTGAAGCCTTCTTGGGACAXSHJ ID NO: 33) 
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The C nucleotide immediately 3' to the BamHl site in SEQ ID NO: 32 
corresponds to nucleotide 435 in SEQ ID NO: 1; the G nucleotide 3 ' to tiieXhol 
site in SEQ ID NO: 33 is complementary to nucleotide 1067 in SEQ ID NO: 1, 
The amplified I domain is digested with the appropriate enzymes, the purified 
5 fragment ligated into the mammalian expression vector pDCs and the prokaryotic 

expression vector pGEX-4T-3 (Pharmacia) and the I domain fragment sequenced. 
The fusion protein is then expressed in COS, CHO or E.coli cells transfected or 
transformed with an appropriate expression construct. 

Given the affinity of or^ for ICAM-R, expression of the I 
10 domain may be of sufficient affinity to be a useful inhibitor of cell adhesion in 

which participates. 

Analysis of Human I Domain/IgG4 Fusion Proteins 

Protein was resolved by SDS-PAGE under reducing and non- 
reducing conditions and visualized by either silver staining or Coomassie staining. 
15 Protein was then transferred to Immobilon PVDF membranes and subjected to 

Western blot analysis using anti-human IgG monoclonal antibodies or anti-bovine 
Ig monoclonal antibodies. 

Protein detected was determined to migrate at about 120 kD under 
non-reducing conditions and at about 45 kD under reducing conditions. Minor 
20 bands were also detected on non-reducing gels at approximately 40-50 kD which 

were reactive with the anti-human, but not anti-bovine, antibodies. A 200 kD 
minor band was determined to be bovine Ig by Western blot. 

Binding Assays Using I Domain Expression Products 

The ability of the I domain to specifically recognize ICAM-R/IgG 
25 chimeric protein was tested in an ELISA format. Serial dilutions of a^i 

I domain IgG4 fusion protein (Ia^/IgG4) in TBS were incubated with ICAM- 
1/IgG, ICAM-R/IgG, VCAM-l/IgG, or an irrelevant IgGl myeloma protein 
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immobilized on Immulon IV RIA/EIA plates. CDlla I domain/IgG chimeric 
protein and human IgG4/kappa myeloma protein were used as negative controls- 
Bound IgG4 was detected with the biotinylated anti-IgG4 monoclonal antibody 
HP6023 followed by addition of strepavidin-peroxidase conjugate and development 
5 with substrate o-phenyldiamine. 

In repeated assays, no binding of the CDlla/IgG4 protein or the 
IgG4 myeloma protein was detected with any of the immobilized proteins. The 
IayigG4 protein did not bind to fish skin gelatin or bovine serum albumin 
blocking agents, human IgGl, or ICAM-l/IgG. A two to three fold increase in 

10 binding signal over background was detected in ICAM-R/IgG protein coated wells 

using 1-5 ftg/ml concentrations of lQr^/IgG4 protein. The signal in VCAM-l/IgG 
protein coated wells was 7-10 fold higher than background. In previous assays, 
orj/CD18 transfected CHO cells did not bind VCAM-l/IgG protein, suggesting 
that VCAM-1 binding may be characteristic of isolated I domain amino acid 

15 sequences. 

Additional a ^ I domain constructs 

Additional a ^ I domain constructs are generated in the same fashion 
as the previous construct, but incorporating more amino acids around the I 
domain. Specific constructs include: i) sequences from exon 5 (amino acids 127- 

20 353 in SEQ ID NO: 2), preceding the current construct, ii) the EF-hand repeats 

(amino acids 17-603 in SEQ ID NO: 2) following the I domain, and iii) the alpha 
chain truncated at the transmembrane region (amino acids 17-1029 in SEQ ID 
NO: 2), with an IgG4 tail for purification and detection purposes. These 
constructs are ligated into either the mammalian expression vector pDCSl or the 

25 prokaryotic expression vector pGEX-4T-3 (Pharmacia) and the I domain 
sequenced. The fusion proteins are then be expressed in COS, CHO, or E.coli 
cells transformed or transfected with an appropriate expression construct. Protein 
are purified on a ProSepA column (Bioprocessing Limited, Durham, England), 



BNSDOCID: <WO 9517412A1J_> 



wo 95/17412 



PCTAJS94/14832 



-44- 

tested for reactivity with the anti-IgG4 monoclonal antibody HP6023 and 
visualized on polyacrylamide gels with Coomassie staining. 

In order to construct an expression plasmid for the entire 
polypq)tide, pATM.D12, described supra ^ is modified to express an ct^\%GA 
5 fusion protein by the following method. IgG4 encoding DNA is isolated from the 

vector pDCSl by PGR using primers which individually incorporate a 5 ' AatH 
restriction site (SEQ ID NO: 89) and a 3 ' Xha\ restriction site (SEQ ID NO: 90). 

5 '-CGCTGTGACGTCAGAGTTGAGTCCAAATATGG-3 ' (SEQ ID NO: 89) 
5 '-GGTGACACTATAGAATAGGGC-3 ' (SEQ ID NO: 90) 

10 Plasmid pATM.D12 is digested with AoxYL and Xba\, and the appropriately 

digested and purified IgG4 PGR product iigated into the linear vector. 
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Example 15 

Production of Human g^ j -Specific Monoclonal Antibodies 

Transiently txansfected cells from Example 7 were washed three 
times in Dulbecco's phosphate buffered saline (D-PBS) and injected at 5 x 10^ 

5 cells/mouse into Balb/c mice with 50 ftg/mouse muramyl dipeptidase (Sigma) in 

PBS. Mice were injected two more times in the same fashion at two week 
intervals. The pre-bleed and immunized serum from the mice were screened by 
FACS analysis as outlined in Example 9 and the spleen from the mouse with the 
highest reactivity to cells transfected with a^CDlS was fused. Hybridoma 

10 culture supematants were then screened separately for lack of reactivity against 
COS cells transfected with CDlla/CD18 and for reactivity with cells co- 
transfected with an expression plasmid and CD 18. 

This method resulted in no monoclonal antibodies. 

As an alternative for production of monoclonal antibodies, soluble 

15 a^l domain IgG4 fusion protein was affinity purified from supernatant of stably 

transfected CHO cells and used to immunize Balb/c mice as described above, 
Hybridomas were established and supematants from these hybridomas were 
screened by ELISA for reactivity against a^j I domain fusion protein. Positive 
cultures were then analyzed for reactivity with full length a^/CDlS complexes 

20 expressed on CHO transfectants. 

Mouse 1908 received three initial immunizations of a^j/CDlS 
transfected CHO cells and two subsequent boosts with soluble of^/CDlS 
heterodimer. Two final immunizations included 50 /xg/mouse lQ:^/IgG4 fusion 
protein. The fusion produced 270 IgG-producing wells. Supernatant from 45 

25 wells showed at least 7-fold higher binding to lQfj/IgG4 fusion protein than to 

human IgG4 by ELISA. None of the supematants reacted to a^/CDlS transfected 
CHO cells as determined by FACS analysis. 

To determine whether the supematants were able to recognize 
integrin alpha subunit proteins in another context, fresh frozen splenic sections 
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were stained with supematants from 24 of the 45 wells. Three supematants were 
determined to be positive: one stained large cells in the red pulp, while two others 
stained scattered cells in the red pulp and also trabeculae. 

These supematants were fiirther analyzed by their ability to 
5 immunopredpitate biotinylated CDl 8 complexes from either a^/CD 1 8 transfected 

CHO cells or PMA-stimulated HL60 cells. Fusion wells with supematants that 
recognized protein in detergent lysates (which should not be as conformationally 
constrained as protein expressed as heterodimers) were selected for further 
subcloning. Monoclonal antibodies which recognize protein in detergent may be 

10 more useful in immunoprecipitation of heterodimeric complexes from 

transfectants, tissues, and cell lines. 

As another alternative, monoclonal antibodies are generated as 
follows. Affinity purified a^/CDlS heterodimeric protein from detergent lysates 
of stably transfected CHO cells is used with 50 /zg/ml muramyl dipeptidase to 

15 immunize Balb/c mice as described above. Mice receive three immunizations 

before serum reactivity against a^CDlS is determined by immunoprecipitation 
of biotinylated complexes in the CHO transfectants. Hybridomas from positive 
animals are established according to standard protocols, after which hybridoma 
cultures are selected by flow cytometry using a^j/CDlS transfectants. 

20 CDlla/CD18 transfectants are utilized to control for CD18-only reactivity. 

As another alternative for monoclonal antibody production, Balb/c 
mice undergo an immunization/immunosuppression protocol designed to reduce 
reactivity to CHO cell determinants on transfectants used for immunization. This 
protocol involves immunization with untransfected CHO cells and subsequent 

25 killing of CHO-reactive B-cell blasts with cyclophosphamide treatment. After 

three rounds of immunization and cyclophosphamide treatment are performed, the 
mice are immunized with a^/CDlS CHO transfected cells as described above. 

As still another alternative, heterodimeric CD 18 complexes are 
immunoprecipitated from detergent lysates of whole spleen using an anti-CD 18 
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monoclonal antibody, following preclearance of CDlla/CD18 and CDllb/CD18. 
CDlia/'CDlS and CDllb/CD18 complexes are precleared by affinity 
chromatography using monoclonal antibodies TS2/4 and Mol, respectively, 
coupled to a chromatographic resin. The remaining CD 18 complexes are used as 

5 an immunogen in Balb/c mice for the first immunization. Three immunizations 

are given at three week intervals, the initial immunization administered in 
conjunction with Freund's Complete Adjuvant and the subsequent immunizations 
with Freund's Incomplete Adjuvant. Serum is assayed for of^-specific reactivity 
by immunoprecipiution. Resulting hybridomas are screened by flow cytometry 

10 with a^/CD18 CHO transfectants. 

As another alternative, CD 18 complexes from detergent lysates of 
PMA stimulated HL60 cells are enriched by preclearance as described above. 
Other /S2 inicgrins are cleared on the same columns. Immunization with the 
. resulting complexes, hybridoma production, and screening protocols are 

15 performed as described supra. 

Example 16 

Ana1yf;i5; of ffj distribution with polvclonal serum 

Tissue distribution of a^lCDl% was determined using polyclonal 
antiserum. Antiserum used to stain tissue was obtained from a mouse immunized 

20 3 times with transfected CHO cells (D6.CHO, a^/CDlS) with adjuvant peptide 

and once with purified ofa/CD18 heterodimer. A final boost included only 
a^/CDlS heterodimer. Approximately 100 ftl immunized serum was precleared 
by addition of approximately 10^ LFA-1 -transfected CHO cells for 2 hours at 
4°C. The resulting serum was assayed for reactivity at dilutions of 1/5000, 

25 1/10000, 1/20000 and 1/40000 on normal human spleen. The polyclonal antibody 

was reactive at a dilution of 1/20000, while a 1/40000 dilution stained very 
weakly. 
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Once serum was determined to have specific reactivity, it was 
used to stain various lymphoid and non-lymphoid tissues. Monoclonal antibodies 
recognizing CD18, CDlla, CDllb, and CDllc were used in the same 
exi)eriment as controls. Staining of normal spleen sections with polyclonal 
sera, and monoclonal antibodies to CDlla, CDllb, CDllc, and CD18 revealed 
the following results. The pattern observed with polyclonal sera did not 
display the same pattern of labeling as CDlla, CDllb, CDllc, or CD18. There 
is a distinct pattern of labeling with some cells located in the marginal zone of the 
white pulp and a distinct labeling of cells peripheral to the marginal zone. This 
pattern was not observed with the other antibodies. Individual cells scattered 
throughout the red pulp were also labeled which may or may not be the same 
population or subset seen with CDlla and CD18. 

Labeling with CDllc did display some cells staining in the 
marginal zone, but the antibody did not show the distinct ring pattern around the 
white pulp when compared to jxjlyclonal sera, nor did labeling in the red pulp 
give the same pattern of staining as polyclonal sera. 

Therefore, the labeling pattern seen with polyclonal serum was 
unique compared to that seen using antibodies to the other integrins (CDlla, 
CDllb, CDllc, and CD18), and suggests that the in vivo distribution of in 
man is dinstinct from that of other 02 integrins. 

Example 17 

Isolation of Rat cDNA Clones 

In view of the existence of both canine and human subunits, 
attempts were made to isolate homologous genes in other species, including rat 
(this example) and mouse (Example 17, injra), 

A partial sequence of a rat cDNA showing homology to the human 
gene was obtained from a rat splenic XgtlO library (Clontech). The library 
was plated at 2 x 10^ pfu/plate onto 150 mm LBM/agar plates. The library was 
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lifted onto Hybond membranes (Amersham), denatured 3 minutes, neutralized 3 
minutes and washed 5 minutes with buffers as described in standard protocols 
[Sambrook, et al. Molecular Cl oning: a laboratory manual, p.2.110]. The 
membranes were placed immediately into a Stratalinker (Stratagene) and the DNA 

5 crosslinked using the autocrosslinking setting. The membranes were 
prehybridized and hybridized in 30% or 50% formamide, for low and high 
stringency conditions, respectively. Membranes were initially screened with a 
^^P-labeled probe generated from the human cDNA, corresponding to bases 
500 to 2100 in clone 19A2 (SEQ ID NO: 1). The probe was labeled using 

10 Boehringer Mannheim's Random Prime Kit according to manufacturer's suggested 
protocol. Filters were washed with 2X SSC at 55 ""C. 

Two clones, designated 684.3 and 705.1, were identified which 
showed sequence homology to human aj, human CDllb, and human CDllc. 
Both clones aligned to the human gene in the 3 ' region of the gene, starting 

15 at base 1871 and extending to base 3012 for clone 684.3, and bases 1551 to 3367 

for clone 705.1. 

In order to isolate a more complete rat sequence which included the 
5 ' region, the same library was rescreened using the same protocol as employed 
for the initial screening, but using a mouse probe generated from clone A 1160 
20 (See Example 17, infra). Single, isolated plaques were selected from the second 
screening and maintained as single clones on LBM/agar plates. Sequencing 
primers 434FL and 434FR (SEQ ID NOS: 34 and 35, respectively) were used in 
a standard PGR protocol to generate DNA for sequencing. 

5 '-TATAGACTGCTGGGTAGTCCCCAC-3 ' (SEQ ID NO: 34) 

25 5 '-TGAAGATTGGGGGTAAATAACAGA-3 ' (SEQ ID NO: 35) 

DNA from the PGR was purified using a Quick Spin Column (Qiagen) according 
to manufacturer's suggested protocol. 
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Two clones, designated 741.4 and 741.11, were identified which 
overlapped clones 684.3 and 705.1; in the overlapping regions, clones 741.1 and 
741.11 were 100% homologous to clones 684.3 and 705.1. A composite rat 
cDNA having homology to the human gene is set out in SEQ ID NO: 36; the 
predicted amino acid sequence is set forth in SEQ ID NO: 37. 

Cloning of the 5 ' en d of Rat 

A 5 * cDNA fragment for the rat gene was obtained using a 
Clonetech rat spleen RACE cloning kit according to manufacturer's suggested 
protocol. The gene specific oligonucleotides used were designated 741. 1 1#2R and 
741.2#1R (SEQ ID NOS: 59 and 58, respectively). 

5 -CCAAAGCTGGCTGCATCCTCTC-3 ' (SEQ ID NO: 59) 

5 -GGCCTTGCAGCTGGACAATG-3 ' (SEQ ID NO: 58) 

Oligo 741.11#2R encompasses base pairs 131-152 in SEQ ID NO: 36, in the 
reverse orientation and 741.2#1R encompasses bases pairs 696-715 in SEQ ID 
NO: 36, also in the reverse orientation. A primary PCR was carried out using 
the 3 '-most oligo, 741,2#1R. A second PCR followed using oligo 741.11#2R 
and DNA generated from the primary reaction. A band of approximately 300 
base pairs was detected on a 1 % agarose gel. 

The secondary PCR product was ligated into plasmid pCRTAII 
(Invitrogen) according to manufacturer's suggested protocol. White (positive) 
colonies were picked and added to 100 fil LBM containing 1 ^1 of a 50 mg/ml 
carbenicillin stock solution and 1 /zl M13 K07 phage culture in individual wells 
in a round bottom 96 well tissue culture plate. The mixture was incubated at 
37*'C for 30 minutes to one hour. Following the initial incubation period, 100 /xl 
of LBM (containing 1 ^xl of 50 mg/ml carbenicillin and a 1:250 dilution of a 10 
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mg/ml kanamycin stock solution) were added and the incubation was continued 

overnight at 37 °C. 

Using a sterile 96 well metal transfer prong, supernatant from the 

96 well plate was transferred to four Amersham Hybond nylon filters. The filters 
5 were denatured, neutralized and cross linked by standard protocols. The filters 

were prehybridized in 20 mis of prehybridization buffer (5X SSPE; 5X 

Denhardts; 1% SDS; 50 ugs/ml denatured salmon sperm DNA) at 50^0 for 

several hours while shaking. 

OUgo probes 74L11#1 and 741.11#1R (SEQ ID NOS: 56 and 57, 
10 respectively), encompassing base pairs 86-105 (SEQ ID NO: 36) in the forward 

and reverse orientation respectively, were labeled as follows. 

5 '-CCTGTCATGGGTCTAACCTG-3 ' (SEQ ID NO: 56) 

5 -AGGTTAGACCCATGACAGG-B ' (SEQ ID NO: 57) 

Approximately 65 ng oligo DNA in 12 fi\ dH20 was heated to 65oC for two 
15 minutes. Three fil of 10 mCi/ml 7-^^P-ATP were added to the tube along with 

4 ^1 5x Kinase Buffer (Gibco) and 1 fxl T4 DNA Kinase (Gibco). The mixture 
was incubated at 37oC for 30 minutes. Following incubation, 16 nl of each 
labeled oligo probe were added to the prehybridization buffer and filters and 
hybridization was continued overnight at 42oC. The filters were washed three 
20 times in 5X SSPE; 0. 1 % SDS for 5 minutes per wash at room temperature, and 
autoradiographed for 6 hours. Positive clones were expanded and DNA purified 
using the Magic Mini Prep Kit (Promega) according to manufacturer's suggested 
protocol. Clone 2F7 was selected for sequencing and showed 100% homology 
- clone 741.11 in the overlapping region. The complete rat nucleic acid 
25 sequence is set out in SEQ ID NO: 54; the amino acid sequence is set out in SEQ 

ID NO: 55. 
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Characteristics of the Rat cDNA and Amino Acid Sequences 

Neither nucleic acid nor amino acid sequences have previously been 
reported for rat a subunits in ^2 integrins. However sequence comparisons to 
reported human ^2 integrin a subunits suggests that the isolated rat clone and its 
predicted amino acid sequence are most closely related to nucleotide and amino 
acid sequences. 

At the nucleic acid level, the isolated rat cDNA clone shows 80% 
identity in comparison to the human cDNA; 68% identity in comparison to 
human CDllb; 70% identity in comparison to human CDllc; and 65% identity 
in comparison to mouse CDllb. No significant identity is found in comparison 
to human CDlla and to mouse CDlla. 

At the amino acid level, the predicted rat polypeptide encoded by 
the isolated cDNA shows 70% identity in comparison to human polypeptide; 
28% identity in comparison to human CDlla; 58% identity in comparison to 
human CDllb; 61% identity in comparison to human CDllc; 28% identity in 
comparison to mouse CDlla; and 55% identity in comparison to mouse CDllb, 

Elxample 18 

Monoclonal Antibod ies against Rat rv^ T domain/Hu TpG4 Fusion Proteins 

In view of the fact that the I domain of human ^2 integrins has 
been demonstrated to participate in ligand binding, it was assumed that the same 
would be true for rat protein. Monoclonal antibodies immunospecific for the 
rat I domain may therefore be useful in rat models of human disease states 
wherein binding is implicated. 

Oligos "rat alpha-DI5" (SEQ ID NO: 87) and "rat alpha-DI3" (SEQ 
ID NO: 88) were generated from the rat sequence corresponding to base pairs 
469-493 and base pairs 1101-1125 (in the reverse orientation), respectively, in 
SEQ ID NO: 54. The oligos were used in a standard PCR reaction to generate 
a rat DNA fragment containing the I domain spanning base pairs 459-1125 in 
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SEQ ID NO: 54. The PCR product was ligated into vector pCRTAII (Invitrogen) 
according to manufacturer's suggested protocol. A positive colony was selected 
and expanded for DNA purification using a Qiagen (Chatswoth, GA) Midi Prep 
kit according to manufacturer's protocol. The DNA was digested with Xhol and 
5 Bgin in a standard restriction enzyme digest and a 600 base pair band was gel 

purified which was subsequently ligated into pDCSl/HuIgG4 expression vector. 
A positive colony was selected, expanded and DNA purified with a Quiagen Maxi 
Prep Kit. 

COS cells were plated at half confluence on 100mm culture dishes 
10 and grown overnight at 37oC in 7% CO2. Cells were rinsed once with 5 ml 

DMEM. To 5 ml DMEM, 50 /xl DEAE-Dextran, 2 /xl chloroquine and 15 fig rat 
I domain/HulgG4 DNA described above was added. The mixture was added 
to the COS cells and incubated at 37 for 3 hours. Media was then removed 
and 5 ml 10% DMSO in CMF-PBS was added for exactly one minute. The cells 
15 were gently rinsed once with DMEM. Ten ml DMEM containing 10% FBS was 

added to the cells and incubation continued overnight at 37oC in 7% CO2. The 
next day, media was replaced with fresh media and incubation continued for three 
additional days. The media was harvested and fresh media was added to the 
plate. After three days, the media was collected again and the plates discarded. 
20 The procedure was repeated until 2 liters of culture supernatant were collected. 

Supernatant collected as described above was loaded onto a 
Prosep-A column (Bioprocessing Limited) and protein purified as described 
below. 

The column was initially washed with 15 column volumes of Wash 
25 Buffer containing 35 mM Tris and 150 mM NaCl, pH 7.5. Supernatant was 

loaded at a slow rate of less than approximately 60 column volumes per hour. 
After loading, the column was washed with 15 column volumes of Wash Buffer, 
15 column volumes of 0.55 M diethanolamine, pH 8.5, and 15 column volumes 
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50 mM citric acid, pH 5,0. Protein was eluted with 50 mM citric acid, pH 3.0. 
Protein was neutralized with 1.0 M Tris, pH 8.0, and dialyzed in sterile PBS. 

The rat or ^ I domain protein was analyzed as described in Example 
14. The detected protein migrated in the same manner as observed with human 
I domain protein. 

Immunization Protocol 

Mice were individually immunized with 50 fig purified rat I 
domain/HulgG4 fusion protein previously emulsified in an equal volume of 
Freunds Complete Adjuvant (FCA) (Sigma), Approximately 200 ^1 of the 
antigen/adjuvant preparation was injected at 4 sites in the back and flanks of each 
of the mice. Two weeks later the mice were boosted with an injection of 100 ^1 
rat ofjj I domain/HuIgG4 antigen (50 /xg/mouse) previously emulsified in an equal 
volume of Freunds Incomplete Adjuvant (FIA). After two additional weeks, the 
mice were boosted with 50 fig antigen in 200 ^1 PBS injected intravenously. 

To evaluate serum titers in the immunized mice, retro-orbital bleeds 
were performed on the animals ten days following the third immunization. The 
blood was allowed to clot and serum isolated by centrifugation. The serum was 
used in an immunoprecipitation on biotinylated (BIP) rat splenocytes. Serum 
from each mouse immunoprecipitated protein bands of expected molecular weight 
for rat and rat CD 18. One mouse was selected for the fusion and was boosted 
a fourth time as described above for the third boost. 

The hybridoma supematants were screened by antibody capture, 
described as follows. Immulon 4 plates (Dynatech, Cambridge, Massachusetts) 
were coated at 4''C with 50 /xl/well goat anti-mouse IgA, IgG or IgM (Organon 
Teknika) diluted 1:5000 in 50 mM carbonate buffer, pH 9.6. Plates were washed 
3X with PBS containing 0.05% Tween 20 (PBST) and 50 fil culture supernatant 
was added. After incubation at 37 "^C for 30 minutes, and washing as described 
above, 50 fil horseradish peroxidase-conjugated goat anti- mouse IgG9(Fc) 
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(Jackson ImmunoResearch, West Grove, Pennsylvania) diluted 1:3500 in PBST 
was added. Plates were incubated as described above and washed 4X with PBST. 
Immediately thereafter, lOO/il substrate, containing 1 mg/ml o-phenylene diamine 
(Sigma) and 0.1 ixUml 30% H2O2 in 100 mM citrate, pH4.5, was added. The 

5 color reaction was stopped after 5 minutes with the addition of 50 /xl 15 % H2SO4. 

Absorbance at 490 nm was read on a Dynatech plate reader. 

Supernatant from antibody-containing wells was also analyzed by 
ELISA with immobilized rat I domain/HulgG4 fusion protein. An ELISA with 
HuIgG4 antibody coated plates served as a control for reactivity against the IgG 

10 fusion partner. Positive wells were selected for further screening by BIP on rat 
splenocyte lysates using techniques described below. 

Biotinvlation of Cell Surf ace Antigens 

Rats were sacrificed by asphyxiation with CO2 and spleens were 

removed using standard surgical techniques. Splenocytes were harvested by 
15 gently pushing the spleen through a wire mesh with a 3 cc syringe plunger in 20 

mis RPMI, Cells were collected into a 50 ml conical tube and washed in the 

appropriate buffer. 

Cells were washed three times in cold D-PBS and resuspended at 

a density of 10^ to l(f cells in 40 ml PBS. Four mg of NHS-Biotin (Pierce) was 
20 added to the cell suspension and the reaction was allowed to continue for exactly 

15 minutes at room temperature. The cells were pelleted and washed three times 

in cold D-PBS. 

Cell Lvsates 

Cells were resuspended at a density of 10^ cells/ml in cold lysis 
25 Buffer (1 % NP40; 50 mM Tris-HCl, pH 8.0; 150 mM NaCl; 2 mM CaCl; 2 mM 
MgCl; 1:100 solution of pepstain, leupeptine, and aprotinin, added just before 
adding to cells; and 0.0001 g PMSF crystals, added just before adding to cells). 
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Lysates were vortexed for approximately 30 seconds, incubated for 5 minute at 
room temperature, and further incubated for 15 minutes on ice. Lysates were 
centrifuged for 10 minutes at 10,000 xg to peUet the insoluble material. 
Supernatant was collected into a new tube and stored at between 4°C and -lOoQ. 

Immunoprecipitation 

One ml cell lysate was precleared by incubation with 200 fil of a 
protein A sepharose slurry (Zymed) overnight at 4^*0, Precleared lysate was 
aliquoted into Eppendorf tubes at 50 fil/tuhc for each antibody to be tested. 
Twenty-five /xl of polyclonal serum or 100 to 500 of monoclonal antibody 
supernatant were added to the precleared lysates and the resulting mixture 
incubated for 2 hours at 4''C with rotation. One hundred ^1 rabbit anti-mouse 
IgG (Jackson) bound to protein A sepharose beads in a PBS slurry was then added 
and incubation continued for 30 minutes at room temperature with rotation. 
Beads were pelleted with gentle centrifugation, and washed three times with cold 
Wash Buffer (10 mM HEPES; 0.2 M NaCl; 1% Trition X-100). Supernatant 
was removed by aspiration, and 20 ^1 2X SDS sample buffer containing 10% 
^-mercaptoethanol was added. The sample was boiled for 2 minutes in a water 
bath, and the sample loaded onto a 5% SDS PAGE gel. Following separation, 
the proteins were transferred to nitrocellulose at constant current overnight. The 
nitrocellulose filters were blocked with 3% BSA in TBS-T for 1 hour at room 
temperature and the blocking buffer was removed. A 1:6000 dilution of 
Strepavidin-HRP conjugate (Jackson) in 0.1% BSA TBS-T was added and 
incubation continued for 30 minutes at room temperature. Filters were washed 
three times for 15 minutes each with TBS-T and autoradiographed using 
Amersham's ECL kit according to manufacturer's suggested protocol. 



wo 95/17412 



PCTA3S94/14832 



-57 - 
Example 19 

Isolation of Mouse cDNA Clones 

Isolation of a mouse homolog was attempted. 

Cross-species hybridization was performed using two PGR- 
5 generated probes: a 1.5 ld> fragment corresponding to bases 522 to 2047 from 

human clone 19A2 (SEQ ID NO: 1), and a LO kb rat fragment which corresponds 
to bases 1900 to 2900 in human clone 19A2 (SEQ ID NO: 1). The human probe 
was generated by PCR using primer pairs designated ATM-2 and 9-10. 1 set out 
in SEQ ID NOS: 38 and 39, respectively; the rat probe was generated using 
10 primer pairs 434L and 434R, set out in SEQ ID NOS: 34 and 35, respectively. 

Samples were incubated at 94 ''C for 4 minutes and subjected to 30 cycles of the 
temperature step sequence: 94°C; 50°C 2 minutes; 72^C, 4 minutes, 

5'-GTCCAAGCTGTCATGGGCCAG-3' (SEQ ID NO: 38) 

5 '-GTCCAGCAGACTGAAGAGCACGG-3 ' (SEQ ID NO: 39) 

15 The PCR products were purified using the Qiagen Quick Spin kit 

according to manufacturer's suggested protocol, and approximately 180 ng DNA 
was labeled with 200 fiCi [^^P]-dCTP using a Boehringer Mannheim Random 
Primer Labeling kit according to manufacturer's suggested protocol. 
Unincorporated isotope was removed using a Centri-sep Spin Column (Princeton 

20 Separations, Adelphia, NJ) according to manufacturer's suggested protocol. The 
probes were denatured with 0.2 N NaOH and neutralized with 0.4 M Tris-HCl, 
pH 8.0, before use. 

A mouse thymic oligo dT-primed cDNA library in lambda ZAP II 
(Stratagene) was plated at approximately 30,000 plaques per 15 cm plate. Plaque 

25 lifts on nitrocellulose filters (Schleicher & Schuell, Keene, NH) were incubated 

at 50 "^C with agitation for 1 hour in a prehybridization solution (8 ml/lift) 
containing 30% formamide. Labeled human and rat probes were added to the 
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prehybridization solution and incubation continued overnight at 50**C. Filters 
were washed twice in 2X SSC/0. 1 % at room temperature, once in 2X SSC/0. 1 % 
SDS at ST'C, and once in 2X SSC/0. 1% SDS at 42 »C. Filters were exposed on 
Kodak X-Omat AR film at -80°C for 27 hours with an intensifying screen. 

Four plaques giving positive signals on duplicate lifts were 
restreaked on LB medium with magnesium (LBM)/carbenicillin (100 mg/ml) 
plates and incubated overnight at 37 °C. The phage plaques were lifted with 
Hybond filters (Amersham), probed as in the initial screen, and exposed on Kodak 
X-Omat AR film for 24 hours at -80 "C with an intensifying screen. 

Twelve plaques giving positive signals were transferred into low 
Mg++ phage diluent containing 10 mM Tris-HCl and 1 mM MgClj. Insert size 
was determined by PGR amplification using T3 and T7 primers (SEQ ID NOS: 
13 and 14, respectively) and the following reaction conditions. Samples were 
incubated at 94 ''C for 4 minutes and subjected to 30 cycles of the temperature 
step sequence: 94*C, for 15 seconds; SCC, for 30 seconds; and 72*C for 1 
minute. 

Six samples produced distinct bands that ranged in size from 300 
bases to 1 kb. Phagemids were released via co-infection with helper phage and 
recircularized to generate Bluescript SK" (Stratagene). The resulting colonies 
were cultured in LBM/carbenicillin (100 mg/ml) overnight. DNA was isolated 
with a Promega Wizard miniprep kit (Madison, WI) according to manufacturer's 
suggested protocol. EcoRI restriction analysis of purified DNA confirmed the 
molecular weights which were detected using PGR. Insert DNA was sequenced 
with M13 and M13 reverse. I primers set out in SEQ ID NOS: 40 and 41, 
respectively. 



5 '-TGTAAAACGACGGCCAGT-3 ' 
5 '-GGAAACAGCTATGACCATG-3 ' 



(SEQ ID NO: 40) 
(SEQ ID NO: 41) 
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Sequencing was performed as described in Example 4, 

Of the six clones, only r^vo, designated 10.3-1 and 10.5-2, provided 
sequence information and were identical 600 bp fragments. The 600 bp sequence 
was 68% identical to a corresponding region of human a^, 40% identical to 
human CD 11a, 58% identical to human CDllc, and 54% identical to mouse 
CD lib. This 600 bp fragment was then utilized to isolate a more complete 
cDNA encoding a putative mouse homolog , 

A mouse splenic cDNA library (oligo dT and random-primed) in 
lambda Zap II (Straugene) was plated at 2,5 x 10^ phage/15 cm LBM plate. 
Plaques were hfied on Hybond nylon transfer membranes (Amersham), denatured 
with 0.5 M NaOH/1.5 M NaCl, neutralized with 0.5 M Tris Base/1.5 M 
NaCl/1 L6 HQ, and washed in 2X SSC. The DNA was cross-linked to filters by 
ultraviolet irradiation. 

Approximately 500,000 plaques were screened using probes 10.3-1 
and 10.5-2 previously labeled as described supra. Probes were added to a 
prehybridization solution and incubated overnight at 50°C, The filters were 
washed twice in 2X SSC/0.1% SDS at room temperature, once in 2X SSC/0.1% 
SDS at ST^'C, and once in 2X SSC/0. 1 % SDS at 42'*C. Filters were exposed on 
Kodak X-Omat AR film for 24 hours at -SC'C with an intensifying screen. 
Fourteen plaques giving positive signals on duplicate lifts were subjected to a 
secondary screen identical to that for the initial screen except for additional final 
high stringency washes in 2X SSC/ 0.1% SDS at 50^*0, in 0.5X SSC/0.1% SDS 
at 50^*0, and at 55**C in 0.2X SSC/0, 1% SDS. The filters were exposed on 
Kodak X-Omal AR film at -80**C for 13 hours with an intensifying screen. 

Eighteen positive plaques were transferred into low Mg"^"*" phage 
diluent and insert size determined by PCR amplification as described above. 
Seven of the samples gave single bands that ranged in size from 600 bp to 4 kb. 
EcoRL restriction analysis of purified DNA confirmed the sizes observed from 
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PGR and the DNA w£is sequenced with primers M13 and M13 reverse. 1 (SEQ ID 
NOS: 40 and 41, respectively). 

One clone designated B3800 contained a 4 kb insert which 
corresponded to a region 200 bases downstream of the 5 ' end of the human 
19A2 clone and includes 553 bases of a 3 ' untranslated region. Clone B3800 
showed 77% identity to a corresponding region of human a^, 44% identity to a 
corresponding region of human GDI la, 59% identity to a corresponding region 
of human CDllc, and 51% identity to a corresponding region of mouse CDllb. 
The second clone A1160 was a 1.2 kb insert which aligned to the 5 ' end of the 
coding region of human approximately 12 nucleic acids downstream of the 
initiating methionine. Clone A1160 showed 75% identity to a corresponding 
region of human orj, 46% identity to a corresjxsnding region of human GDI la, 
62% identity to a corresponding region of human CDllc, and 66% identity to a 
corresponding region of mouse GDI lb. 

Clone Al 160, the fragment closer to the 5 ' end of human clone 
19A2, is 1160 bases in length, and shares a region of overlap with clone B3800 
starting at base 205 and continuing to base 1134. Clone A1160 has a 110-base 
insertion (bases 704-814 of clone A1160) not present in the overlapping region of 
clone B3800. This insertion occurs at a probable exon-intron boundary [Fleming, 
et al., J.Immunol. 750:480-490 (1993)] and was removed before subsequent 
ligation of clones A1160 and B3800. 

Rapid Amplification of 5 ' cDNA End of the Putative Mn»5;ft /vj rinnp 

RAGE PGR [Frohman, "RAGE: Rapid Amplification of cDNA 
Ends," in PCR Protocols: A Guide to Methods an d Annlicatinn<: innis, et al. 
(eds.) pp. 28-38, Academic Press:New York (1990)] was used to obtain missing 
5 ' sequences of the putative mouse clone, including 5 ' untranslated sequence 
and initiating methionine.' A mouse splenic RAGE-Ready kit (Clontech, Palo 
Alto, CA) was used according to the manufacturer's suggested protocol. Two 
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antisense, gene-specific primers, A1160 RACEl-primary and A1160 RACE2- 
nested (SEQ ID NOS: 42 and 43), were designed to perform primary and nested 
PGR. 

5 -GGACATGTrCACTGCCTCTAGG-3 ' (SEQ ID NO: 42) 

5 5 '.GGCGGACAGTCAGACGACTGTCCTG-3 ' (SEQ ID NO: 43) 

The primers, SEQ ID NOS: 42 and 43, correspond to regions 
starting 302 and 247 bases from the 5 ' end, respectively, PGR was performed 
as described, supra, using the 5 ' anchor primer (SEQ ID NO: 44) and mouse 
spleen cDNA supplied with the kit, 

10 5 '-CTGGTTCGGCCGAGGTGTGAAGGTTGGAGAATGGAT^SB^ID NO: 44) 

Electrophoresis of the PGR product revealed a band approximately 280 bases in 
size, which was subcloned using a TA cloning kit (Invitrogen) according to 
manufacturer's suggested protocol. Ten resulting colonies were cultured, and the 
DNA isolated and sequenced. An additional 60 bases of 5' sequence were 
15 identified by this method, which correspond to bases 1 to 60 in SEQ ID NO: 45. 

Characteristics of the Mouse cDNA and Predicted Amino Acid Sequence 

A composite sequence of the mouse cDNA encoding a putative 
homolog of human ol^ is set out in SEQ ID NO: 45. Although homology between 
the external domains of the human and mouse clones is high, homology between 
20 the cytoplasmic domains is only 30%. The observed variation may indicate C- 
terminal functional differences between the human and mouse proteins. 
Alternatively, the variation in the cytoplasmic domains may result from splice 
variation, or may indicate the existence of an additional ^2 i^^tegrin gene(s). 
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At the amino acid level, the mouse cDNA predicts a protein (SEQ 
ID NO: 46) with 28% identity to mouse CDlla, 53% identity to mouse CDllb, 
28% identity to human CDlla, 55% identity to human CDllb, 59% identity to 
human CDllc, and 70% identity to human a^. Comparison of the amino acid 
sequences of the cytoplasmic domains of human and the putative mouse 
homolog indicates regions of the same length, but having divergent primary 
structure. Similar sequence length in these regions suggests species variation 
rather than splice variant forms. When compared to the predicted rat polypq)tide, 
Example 16. supra, mouse and rat cytoplasmic domains show greater than 60% 
identity. 



Example 20 

Isolation of additional mouse r PNA clones for seg uen re verificatinn 

In order to verify tiie nucleic and amino acids sequences describe 
in Example 19 for mouse a^, additional mouse sequences were isolated for the 
purposes of confirmation. 

Isolation of mouse cDNA by hybridization with two homologous 
ttd probes (3 ' and 5 ') was performed using botii a mouse splenic random primed 
library and an oligo dT-primed cDNA library in lambda ZAE II (Strategene). 
The library was plated at 5 x 10^ phage per 15 cm LBM plate. Plaques were 
lifted on Hybond nylon membranes (Amersham), and tiie membranes were 
denatured (0.5 M NaOH/1.5 M NaCl), neutralized (0.5 M Tris Base/1.5 M NaCl 
/ 11.6 M HCl) and washed (2X SSC salt solution). DNA was cross-lined to 
filters by ultraviolet irradiation. 

Probes were generated using primers described below in a PCR 
reaction under the following conditions. Samples were held at 94 "C for 4 
minutes and then run tiirough 30 cycles of the temperature step sequence (94*C 
for 15 seconds; 50°C for 30 seconds; 72»C for 1 minute in a Perkin-Elmer 9600 
thermocycler). 
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The 3 ' probe was approximately 900 bases long and spanned a 
region from nucleotides 2752 to 3651 (in SEQ ID NO: 1) (5' ^ 3') and was 
produced with primers ll.b-l/2FORll and lLb-l/2REV2 as shown in SEQ ID 
NOS: 69 and 74, respectively. This probe was used in a first set of lifts. 
5 The 5 ' probe was approximately 800 bases long and spanned a 

region from nucleotides 149 to 946 (in SEQ ID NO: 1) (5 ^ 3 ^ and was 
produced with primers ll,b-l/2FORl and ll.a-1/lREVl as shown in SEQ ID 
NOS: 50 and 85, respectively). This probe was used in a second set of lifts. 

In a third set of lifts, both probes described above were used 
10 together on the same plates. 

Approximately 500,000 plaques were screened using the two probes 
from above which were labeled in the same way as described in Example 17. 
Labeled probes were added to a prehybridization solution, containing 45% 
formamide, and incubated overnight at SO^'C. Filters were washed twice in 2X 
15 SSC/0. 1 % SDS at room temperature (22'*C). A final wash was carried out in 2X 

SSC/O.1% SDS at 50''C. Autoradiography was for 19 hours at -SO^'C on Kodak 
X-Omat AR film with an intensifying screen. 

Thirteen plaques giving positive signals on at least duplicate lifts 
were subjected to a secondary screen performed as described for the initial screen 
20 except that both the 3 ' and 5 ' labeled probes were used for hybridization and an 

additional final wash was incorporated using 2X SSC/O.1% SDS at 65 X. 
Autoradiography was performed as described above for 2.5 hours. 

Thirteen plaques (designated MS2P1 through MS2P13) giving 
positive signals were transferred into low Mg phage diluent. Insert size was 
25 determined by PGR amplification (Perkin-Elmer 9600 thermocycler) using T3 and 

T7 primers which anneal to Bluescript phagemid in ZAP II (sequence previously 
described) under the same conditions shown above. Band sizes ranged from 500 
bases to 4Kb. Phagemids were isolated, prepared, and sequenced with M13 and 
M13 reverse. 1 primers (SEQ ID NOS: 40 and 41, respectively). Five of the 
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thirteen clones; MS2P-3, MS2P-6, MS2P-9, MS2P-12, and MS2P-13, were 
sequenced, and together, represented a region from approximately base 200 at the 
5 ' end to about 300 bases past a first stop codon at the 3 ' end. 

Automated sequencing was performed as described in Example 4 
by first using M13 and M13 reverse.! primers (SEQ ID NOS: 40 and 41, 
respectively) to sequence the ends of each clone and to determine its position 
relative to construct #17 (SEQ ID NO: 45). Each clone was then completely 
sequenced using the appropriate primers (listed below) for that particular region. 



1 1 .b- 1/2FOR1 5 '-GCAGCCAGCTTCGGACAGAC-3 ' (SEQ ID NO: 50) 

10 ll.a-l/lFOR2 5 '-CCGCCTGCCACTGGCGTGTGC-3 ' (SEQ ID NO: 60) 

1 1 .a-l/lFOR3 5 '-CCCAGATGAAGGACTTCGTCAA^S ' (SEQ ID NO: 61) 

1 1 .b- 1/2FOR4 5 '-GCTGGGATCATTCGCTATGC-3 ' (SEQ ID NO: 62) 

1 1 .b- 1/2FOR5 5 '-CAATGG ATGGACCAGTTCTGG-3 ' (SEQ ID NO: 63) 

1 1 .b-l/2FOR6 5 '-CAGATCGGCTCCTACnTGG-3 ' (SEQ ID NO: 64) 

15 ll.b-l/2FOR7 5 '-CATGGAGCCTCGAGACAGG-3 ' (SEQ ID NO: 65) 

ll.b-l/2FOR8 5 '-CCACTGTCCTCGAAGCTGGAG-3 ' (SEQ ID NO: 66) 

ll.b-l/2FOR9 5 '-CTTCGTCCTGTGCTGGCTGTGGGCTC-3 

(SEQ ID NO: 67) 

1 1 .b-l/2FOR10 5 '-CGCCTGGCATGTGAGGCTGAG-3 ' (SEQ ID NO: 68) 

I l.b-l/2FORl 1 5 '-CCGTGATCAGTAGGCAGGAAG-3 ' (SEQ ID NO: 69) 
ll.b-l/2FOR12 5 -GTCACAGAGGGAACCTCC-3' (SEQ ID NO: 70) 
ll.b-l/2FOR13 5'-GCTCCTGAGTGAGGCTGAAATCA-3'(SEQIDNO: 71) 
ll.b-l/2FOR14 5'-GAGATGCTGGATCTACCATCTGC-3'(SEQIDNO: 72) 
ll.b-l/2FOR15 5 '-CTGAGCTGGGAGATmTATGG-3 ' (SEQ ID NO: 73) 

25 ll.b-l/2REV2 5 '-GTGGATCAGCACTGAAATCTG-3 ' (SEQ ID NO: 74) 

I I .b-l/2REV3 5 -CGTITGAAGAAGCCAAGCTTG-S ' (SEQ ID NO: 75) 
1 1 .b-l/2REV4 5 '-CACAGCGGAGGTGCAGGCAG-3 ' (SEQ ID NO: 76) 
ll.b-l/2REV5 5 '-CTCACTGCTTGCGCTGGC-3 ' (SEQ ID NO: 77) 



20 
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1 1 .b- 1/2REV6 5 '-CGGTAAGATAGCTCTGCTGG-3 ' (SEQ ID NO: 78) 

ll.b-l/2KEV7 5 '-GAGCCCACAGCCAGCACAGG-3 ' (SEQ ID NO: 79) 

ll.b-l/2REV8 5'-GATCCAACGCCAGATCATACC-3' (SEQ ID NO: 80) 

ll.b-l/2KEV9 5 '-CACGGCCAGGTCCACCAGGC-3 ' (SEQ ID NO: 81) 

5 il.b-l/2EEV10 5'-CACGTCCCCTAGCACTGTCAG-3' (SEQ ID NO: 82) 

ll.b-l/2REVll 5 '-CCATGTCCACAGAACAGAGAG-3 ' (SEQ ID NO: 51) 

ll.b-l/2REV12 5 '-TTGACGAAGTCCTTCATCTGGG-S ' (SEQ ID NO: 83) 

1 1 .b-l/2REV13 5 '-GAACTGCAAGCTGGAGCCCAG-3 ' (SEQ ID NO: 84) 

1 l.a-1/lREVl 5 '-CTGGATGCTGCGAAGTGCTAC-3 ' (SEQ ID NO: 85) 

10 Il.a-1/1REV2 5'-GCCTTGGAGCTGGACGATGGC-3' (SEQ ID NO: 86) 



Sequences were edited, aligned, and compared to a previously isolated mouse 
sequence (construct #17, SEQ ID NO: 45). 

Alignment of the new sequences revealed an 18 base deletion in 
construct #17 beginning at nucleotide 2308; the deletion did not cause a shift in 

15 the reading ftame. Clone MS2P-9, sequenced as described above, also revealed 
the same 18 base deletion. The deletion has been observed to occur in 50% of 
mouse clones that include the region but has not been detected in rat or human 
clones. The eighteen base deletion is characterized by a 12 base palindromic 
sequence AAGCAGGAGCTCCTGTGT (SEQ ID NO: 91). This inverted repeat 

20 in the nucleic acid sequence is self-complementary and may form a loop out, 

causing cleavage during reverse transcription. The mouse sequence which 
includes the additional 18 bases is set forth in SEQ ID NO: 52; the deduced 
amino acid sequence is set forth in SEQ ID NO: 53. 
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Example 21 

In situ hybridizations in Mouse 

Tissue distribution was then determined for mouse in order to 
provide a comparison to that in humans, described in Example 6. 
5 A single stranded 200 bp mRNA probe was generated from a DNA 

template, corresponding to nucleotides 3460 to 3707 in the cytoplasmic tail region 
of the murine cDNA, by in vitro RNA transcription incorporating ^^S-UTP 
(Amersham), 

Whole mouse embryos (harvested at days 11-18 after fertilization) 
10 and various mouse tissues, including spleen, kidney, liver, intestine, and thymus, 
were hybridized in situ with the radiolabeled single-stranded mRNA probe. 

Tissues were sectioned at 6 ^tm thickness, adhered to Vectabond 
(Vector Laboratories, Inc., Burlingame, CA) coated slides, and stored at -70^*0. 
Prior to use, slides were removed from -70**C and placed at 50**C for 
15 approximately 5 minutes. Sections were fixed in 4% paraformaldehyde for 20 

minutes at 4^*0, dehydrated with an increasing ethanol gradient (70-95-100%) for 

1 minute at 4*'C at each concentration, and air dried for 30 minutes at room 
temperature. Sections were denatured for 2 minutes at 70 "^C in 70% 
formamide/2X SSC, rinsed twice in 2X SSC, dehydrated with the ethanol gradient 

20 described supra and air dried for 30 minutes. Hybridization was carried out 

overnight (12-16 hours) at SS^'C in a solution containing ^^S-labeled riboprobes 
at 6 X 10^ cpm/section and diethylpyrocarbonate (DEPC)-treated water to give a 
final concentration of 50% formamide, 03 M NaCl, 20 mM Tris-HCl, pH 7.5, 
10% dextran sulfate, IX Denhardt's solution, 100 mM dithiothreitol (DTT) and 

25 5 mM EDTA. After hybridization, sections were washed for 1 hour at room 

temperature in 4X SSC/10 mM DTT, 40 minutes at 60^*0 in 50% formamide/2X 
SSC/10 mM DTT, 30 minutes at room temperature in 2X SSC, and 30 minutes 
at room temperature in O.IX SSC. The sections were dehydrated, air dried for 

2 hours, coated with Kodak NTB2 photographic emulsion, air dried for 2 hours. 
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developed (after storage at 4''C in complete darkness) and counterstained with 
hematoxylin/eosin. 

Spleen tissue showed a strong signal primarily in the red pulp. This 
pattern is consistent with that of tissue macrophage distribution in the spleen, but 
5 does not exclude other cell types. 

Example 22 

Generation of Mouse Expression Constructs 

In order to construct an expression plasmid including mouse cDNA 
sequences exhibiting homology to human a^, inserts from clones A1160 and 

10 B3800 were ligated. Prior to this ligation, however, a 5' leader sequence, 

including an initiating methionine, was added to clone A1160. A primer 
designated "5 ' PGR leader" (SEQ ID NO: 47) was designed to contain: (1) 
identical nonspecific bases at positions 1-6 allowing for digestion; (2) a BamHl 
site (underlined in SEQ ID NO: 47) from positions 7-12 to facilitate subcloning 

15 into an expression vector; (3) a consensus Kozak sequence from i>ositions 13-18, 

(4) a signal sequence including a codon for an initiating methionine (bold in SEQ 
ID NO: 47), and (5) an additional 31 bases of specifically overlapping 5' 
sequence from clone A1160 to allow primer annealing. A second primer 
designated "3' end frag" (SEQ ID NO: 48) was used with primer "5' PGR 

20 leader" to amplify the insert from clone A1160. 

5 '-AGTTACGGATCCGGCACCATGAC- 

-CTTCGGCACTGTGATCCTCCTGTGTG-3 ' (SEQ ID NO: 47) 

5 '-GCTGGACGATGGCATCCAC-3 ' (SEQ ID NO: 48) 

The resulting PGR product did not digest with BamBl, suggesting 
25 that an insufficient number of bases preceded the restriction site, prohibiting 
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recognition by the enzyme. The length of the "tail" sequence preceding the 
BamUl site in the 5 ' primer (SEQ ID NO: 47) was increased and PGR was 
repeated on the amplification product from the first PGR, A 5' primer, 
designated mAD.5 \2 (SEQ ID NO: 49), was designed with additional nonspecific 
bases at positions 1-4 and an additional 20 bases specifically overlapping the 
previously employed "5 ' PGR leader" primer sequences. 

5 '-GTAGAGTTAGGGATCCGGCAGGAT-3 ' (SEQ ID NO: 49) 

Primers "mAD.5 \2" and "3 ' end frag" were used together in PGR 
with the product from the first amplification as template. A resulting secondary 
PGR product was subcloned into plasmid pGRtmll (Invitrogen) according to 
manufacturer's suggested protocol and transformed into competent One shot cells 
(Invitrogen). One clone containing the PGR product was identified by restriction 
enzyme analysis using BamHl and £toRI and sequenced. After the sequence was 
verified, the insert was isolated by digestion with BaniHI and EcoRI and gel 
purified. 

The insert from clone B3800 was isolated by digestion with £coRI 
and Notl, gel purified, and added to a ligation reaction which included the 
augmented Al 160 BamBI/EcoRl fragment. Ligation was allowed to proceed for 
14 hours at 14°C. Vector pcDNA.3 (Invitrogen), digested with BamHl and Notl, 
was added to the ligation reaction with additional ligase and the reaction was 
continued for another 12 hours. An aliquot of the reaction mixture was 
transformed into competent E. coli cells, the resulting colonies cultured, and one 
positive clone identified by PGR analysis with the primers ll.b-l/2FORl and 
ll.b-l/2REVll (SEQ ID NOS: 50 and 51, respectively). These primers bridge 
the A1160 and B3800 fragments, therefore detection of an amplification product 
indicates the two fragments were ligated. The sequence of the positive clone was 
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thymidine kinase encoding cassettes. Further analysis of this clone with an I 
domain probe (corresponding to nucleotides 454-1064 in SEQ ID NO: 45) 
indicated that the clone did not contain I domain encoding sequences. 

Using the same I domain probe, the XFKH genomic library was 
rescreened. Initially, six positive clones were detected, one of which remained 
positive upon secondary screening. DNA isolated from this clone reacted strongly 
in Southern analysis with an I domain probe. No reactivity was detected using 
the origmal 750 bp probe, however, indicating that this clone included regions 5 ' 
to nucleotides 1985-2773 of SEQ ID NO: 45.. 

Alternatively, the lack of hybridization to the 750 bp probe may 
have suggested that the clone was another member of the integrin family of 
proteins. To determine if this explanation was plausible, the 13 kb insert was 
subcloned into pBluescript SKH"*", Purified DNA was sequenced using primers 
corresponding to I domain nucleic acid sequences 441-461, 591-612, 717-73^, 
and reverse 898-918 in SEQ ID NO: 52. Sequence information was obtained 
using only the first 4441-4461 primer, and only the 5 '-most exon of the I domain 
was efficiently amplified. The remainder of the I domain was not amplified. The 
resulting clone therefore comprised exon 6 of the mouse gene, and intronic 
sequences to the 3 ' and 5 ' end of the exon. Exon 7 was not represented in the 
clone. After sequencing, a construct is generated containing neomycin resistance 
and thymidine kinase genes. 

The neomycin resistance (neoO gene is inserted into the resulting 
plasmid in a manner that interrupts the protein coding sequence of the genomic 
mouse DNA. The resulting plasmid therefore contains a neo^ gene within the 
mouse genomic DNA sequences, all of which are positioned within a thymidine 
kinase encoding region. Plasmid construction in this manner is required to favor 
homologous recombination over random recombination [Chisaka, et ah , Nature 
555:516-520 (1992)]. 
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verified with the primers set out in SEQ ID NOS: 50 and 51, which amplify from 
base 100 to 1405 after the initiating methionine. 

Example 23 

Construction of a Knock-out Mouse 

In order to more accurately assess the immunological role of the 
protein encoded by the putative mouse cDNA, a "knock-out" mouse is 
designed wherein the genomic DNA sequence encoding the putative homolog 
is disrupted by homologous recombination. The significance of the protein 
encoded by the disrupted gene is thereby assessed by the absence of the encoded 
protein. Generation of "knock-out" mice is described in Deng, et aL, 
MoLCell.Biol 75:2134-2140 (1993), 

Design of such a mouse begins with construction of a plasmid 
containing sequences to be "knocked out" by homologous recombination events. 
A 750 base pair fragment of the mouse cDNA (corresponding to nucleotides 1985 
to 2733 in SEQ ID NO: 45) was used to identify a mouse genomic sequence 
encoding the putative mouse ot^ homolog from a XFIXII genomic library. 
Primary screening resulted in 14 positive plaques, seven of which were confirmed 
by secondary screening. Liquid lysates were obtained from two of the plaques 
giving the strongest signal and the X DNA was isolated by conventional methods. 
Restriction mapping and Southern analysis confirmed the authenticity of one 
clone, designated 14-1, and the insert DNA was isolated by digestion with Notl, 
This fragment was cloned into Bluescript SKII"^. 

In order to identify a restriction fragment of approximately 9 to 14 
kb, a length reported to optimize the probability of homologous recombination 
events, Southern hybridization was performed with the 750 bp cDNA probe. 
Prior to hybridization, a restriction map was constructed for clone 14-1. A 12 kb 
fragment was identified as a possible candidate and this fragment was subcloned 
into pBluescript SKII'*' in a position wherein the mouse DNA is flanked by 
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10^ dpm/ml. Hybridization was carried out at 42''C for 16-18 hours. Filters 
were washed extensiveiy in 2X SSPE/0. 1 % SDS at room temperature and exposed 
to X-ray film to visualize any hybridizing plaques. 

Two clones with significant sequence homology to human were 
5 identified- Clone #2 was approximately 800 bp in length and mapped to the 5 ' 

end of human a^j. Clone #2 includes an initiating methionine and complete leader 
sequence. Clone #7 was approximately 1.5 kb and includes an initiating 
methionine. The 5 ' end of clone #7 overlapped that of clone #2, while the 3 " 
sequences terminated at a point beyond the I domain sequences. Internal 
10 sequencing of clone #7 is performed using the nested deletions sequencing 

technique. 

The predicted N terminal amino acid sequence for rabbit of^ as 
determined from clones #2 and #7 indicated a protein with 73% identity with 
human a^j , 65% identity with mouse a^j, and 58% identity with mouse CDllb, 
15 human CDllb, and human CDllc. The nucleic acid sequence for clone #2 is set 

out in SEQ ID NO: 92; the predicted amino acid sequence is set out in SEQ ID 
NO: 93 

Isolation of a full length rabbit cDNA is carried out using 
labeled rabbit fragment, clone # 7, and rescreening the cDNA library from which 
20 the fragment was derived. 

Isolation of a rabbit clone allows expression of the protein, 
either on the surface of transfectants or as a soluble full length or truncated form. 
This protein is then used as an immunogen for the production of monoclonal 
antibodies for use in rabbit models of human disease states. 
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Example 24 

Cloning of Rabbit ot^ - Construction and Screening of the Rabbit cDNA Library 
Identification of human homologs in rats and mice led to the 
investigation of the existence of a rabbit homolog which would be useful in rabbit 
5 models of human disease states described ir^-a. 

Poly A"^ RNA was prepared from a whole rabbit spleen using an 
Invitrogen FastTrack kit (San Diego, CA) according to manufacturer's suggested 
protocol and reagents supplied with the kit. From 1.65 g tissue, 73 fig poly 
A"*'RNA were isolated. The rabbit spleen RNA was used to construct a ZAP 
10 Express cDNA library using a kit from Stratagene (La JoUa, CA), Resulting 

cDNA was directionally cloned into EcdSI and Xhol sites in the lambda arms of 
a pBK-CMV phagemid vector. Gigapack n Gold (Stratagene) was used to 
package the lambda arms into phage particles. The resulting library titer was 
estimated to be approximately 8 x 10^ particles, with an average insert size of 1.2 
15 kb. 

The library was amplified once by plating for confluent plaque 
growth and cell lysate was collected. The amplified library was plated at 
approximately 30,(XX) plaque forming units (pfii) per 150 mm plate with E. coli 
and the resulting mixture incubated for 12-16 hrs at 37oC to allow plaque 

20 formation. Phage DNA was transferred onto Hybond N*^ nylon membranes 

(Amersham, Arlington Heights, Illinois). The membranes were hybridized with 
a mixture of two random primed radiolabeled mouse ot^ PGR DNA probes. The 
first probe was generated from a PGR product spanning nucleotides 149-946 in 
SEQ ID NO: 52. The second probe was from a PGR product spanning 

25 nucleotides 2752-3651 in SEQ ID NO: 52. Probes were labeled by random 

priming (Boehringer Mannheim Random Primed DNA Labeling Kit) and the 
reaction mixture was passed over a Sephadex G-50 column to remove 
unincorporated nucleotides. The hybridization solution was composed of 5X 
SSPE, 5X Denhardts, 1% SDS, 40% Formamide and the labeled probes at 1 x 
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functions which damage normal host tissue through either specific autoimmune 
responses or as a result of bystander cell damage. 

Disease states in which there is evidence of macrophages playing 
a significant role in the disease process include multiple sclerosis, arthritis, graft 
atherosclerosis, some forms of diabetes and infiammatory bowel disease. Animal 
models, discussed below, have been shown to reproduce many of the aspects of 
these human disorders. Inhibitors of aj function are tested in these model 
systems to determine if tiie potential exists for treating the corresponding human 
diseases. 

A. Graft Arteriosclerosis 

Cardiac transplantation is now the accepted form of therapeutic 
intervention for some types of end-state heart disease. As the use of cyclosporin 
A has increased one year survival rates to 80%, the development of progressive 
graft arteriosclerosis has emerged as tiie leading cause of deatii in cardiac 
transplants surviving beyond tiie first year. Recent studies have found tiiat the 
incidence of significant graft arteriosclerosis 3 years following a cardiac transplant 
is in the range of 36-44% [Adams, et al. Transplantation 53:1115-1119 (1992); 
Adams, et ah. Transplantation 55:794-799 (1993)]. 

Graft arteriosclerosis typically consists of diffuse, occlusive, intimal 
lesions which affect the entire coronary vessel wall, and are often accompanied 
by lipid deposition. While the patiiogenesis of graft arteriosclerosis remains 
unknown, it is presumably linked to histocompatibility differences between donor 
and recipient, and is immunologic in nature. Histologically, the areas of intimal 
thickening are composed primarily of macrophages, although T cells are 
occasionally seen. It is therefore possible that macrophages expressing may 
play a significant role in tfie induction and/or development of graft 
arteriosclerosis. In such a case, monoclonal antibodies or small molecule 
inhibitors (for example, soluble ICAM-R) of function could be given 
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Evamplp 2*> 

Animal Models For Determining Therapeutic ITtilify 

Immunohistologic data in dog and in situ hybridization in rats and 
mice has determined that in spleen is expressed primarily by macrophages 
present in red pulp and in lymph nodes, is found in meduUary cords and 
sinuses. The expression pattern is remarkably similar to what has been rqrorted 
for two murine antigens defined by the monoclonal antibodies F4/80 and SK39. 
While biochemical characterization of these murine antigens has demonstrated that 
they are distinct from a^, it is highly probably that defines the same 
macrophage subset as the murine F4/80 and SK39 antigens. 

In mouse, SK39-positive macrophages have been identified in 
splenic red pulp where they may participate in the clearance of foreign materials 
from circulation, and in medulla of lymph nodes [Jutila, et al, J. Leukocyte Biol. 
54:30-39 (1993)J. SK39-positive macrophages have also been reported at sites of 
both acute and chronic inflammation. Furthermore, monocytes recruited to 
thioglycolate-innamed peritoneal cavities also express the SK39 antigen. 
Collectively, these findings suggest that, if SK39"^ cells are also a^^ , then these 
cells are responsible for the clearance of foreign materials in the spleen and 
participate in inflammation where macrophages play a significant role. 

While the function of remains unclear, other more well 
characterized &2 integrins have been shown to participate in a wide variety of 
adhesion events that facilitate cell migration, enhance phagocytosis, and promote 
cell-cell interactions, events which all lead to upregulation of inflammatory 
processes. Therefore, it is highly plausible that interfering with the normal 
function may also interfere with inflammation where macrophages play a 
significant role. Such an anti-inflammatory effect could result fi-om: i) blocking 
macrophage recruitment to sites of inflammation, ii) preventing macrophage 
activation at the site of inflammation or iii) interfering witii macrophage effector 
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lumenal surface of the ascending aorta [Rosenfeld, et ah. Arteriosclerosis 7:9-23 
(1987); Rosenfeld, et al, Artenoscierosis 7:24-34 (1987)]. The atherosclerotic 
lesions seen in these rabbits are simmer to those in humans. Lesions contain 
large numbers of T cells, most of which express CD45RO, a marker associated 
5 with memory T cells. Approximately half of the infiltrating T cells also express 
MHC class n antigen and some express the IL-2 receptor suggesting that many 
of the cells are in an activated state. 

One feature of the atherosclerotic lesions found in cholesterol fed 
rabbits, but apparendy absent in rodent models, is the accumulation of foam cell- 
10 rich lesions. Foam cell macrophages are believed to result from the uptake of 
oxidized low-density lipoprotein (LDL) by specific receptors. Oxidized LDL 
particles have been found to be toxic for some cell types including endothelial 
cells and smooth muscle cells. The uptake of potentially toxic, oxidized LDL 
particles by macrophages serves as an irritant and drives macrophage activation, 
15 contributing to the inflammation associated with atherosclerotic lesions. 

Once monoclonal antibodies have been generated to rabbit a^, 
cholesterol fed rabbits are treated. Treatments include prophylactic administration 
of monoclonal antibodies or small molecule inhibitors, to demonstrate that 
Qrj+ macrophages are involved in the disease process. Additional studies would 
20 demonstrate that monoclonal antibodies to or small molecule inhibitors are 
capable of reversing vessel damage detected in rabbits fed an atherogenic diet. 

C. Insulin-dependent Diabetes 

BB rats spontaneously develop insulin-dependent diabetes at 70-150 
days of age. Using immunohistochemistry, MHC class II+, EDl"^ macrophages 
25 can be detected infiltrating the islets early in the disease. Many of the 
macrophages appear to be engaged in phagocytosis of cell debris or normal ceUs. 
As the disease progresses, larger numbers of macrophages are found infiltrating 
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prophylactically to individuals who received heart transplants and are at risk of 
developing graft arteriosclerosis. 

Although atherosclerosis in heart transplants presents the greatest 
threat to life, graft arteriosclerosis is also seen in other solid organ transplants, 
including kidneys and livers. Therapeutic use of blocking agents could prevent 
graft arteriosclerosis in other organ transplants and reduce complications resulting 
from graft failure. 

One model for graft arteriosclerosis in the rat involves heterotopic 
cardiac allografts transplanted across minor histocompatibility barriers. When 
Lewis cardiac allografts are transplanted into MHC class I and II compatible F- 
344 recipients, 80% of the allografts survive at least 3 weeks, while 25% of the 
grafts survive indefinitely. During this low-grade graft rejection, arteriosclerosis 
lesions form in the donor heart. Arterial lesions in 120 day old allografts 
typically have diffuse fibrotic intimal thickening indistinguishable in appearance 
from graft arteriosclerosis lesions found in rejecting human cardiac allografts. 

Rats are transplanted with hearts mismatched at minor 
histocompatibility antigens, for example Lewis into F-344. Monoclonal antibodies 
specific for rat or small molecule inhibitors of are given periodically to 
transplant recipients. Treatment is expected to reduce the incidence of graft 
arteriosclerosis in non-rejecting donor hearts. Treatment of rats with 
monoclonal antibodies or small molecule inhibitors may not be limited to 
prophylactic treatments. Blocking function is also be expected to reduce 
macrophage mediated inflammation and allow reversal of arterial damage in the 
graft. 

B. Atherosclerosis in Rabbits Fed Cholesterol 
Rabbits fed an atherogenic diet containing a cholesterol supplement 
for approximately 12-16 weeks develop intimal lesions that cover most of the 
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the islets, although significant numbers of T cells, and later B cells, also appear 
to be recruited to the site [Hanenfaerg, et ah, Diabetologia 52:126-134 (1989)]. 

Development of diabetes in BB rats appears to depend on both early 
macrophage infiltration and subsequent T cells recruitment. Treatment of BB rats 

5 with silica particles, which are toxic to macrophages, has been effective in 

blocking the early macrophage infiltration of the islets. In the absence of early 
macrophage infiltration, subsequent tissue damage by an autoaggressive 
lymphocyte population fails to occur. Administration of monoclonal antibody 
OX- 19 (specific for rat CDS) or monoclonal antibody OX-8 (specific for rat 

10 CD8), which block the T cell-associated phase of the disease, is also effective in 
suppressing the development of diabetes. 

The central role of macrophages in the pathology of this model 
makes it attractive for testing inhibitors of function. Rats genetically 
predisposed to the development of insulin-dependent diabetes are treated with 

15 monoclonal antibodies to or small molecule inhibitors and evaluated for the 

development of the disease. Preventing or delaying clinical onset is evidence that 
a J plays a pivotal role in macrophage damage to the islet cells. 

D. Inflammatory Bowel Disease (Crohn's Disease, Ulcerative 

Colitis) 

20 Animal models used in the study of inflammatory bowel disease 

(IBD) are generally elicited by intrarectal administration of noxious irritants (e.g. 
acetic acid or trinitrobenzene sulfonic acid/ethanol). Colonic inflammation 
induced by these agents is the result of chemical or metabolic injury and lacks the 
chronic and spontaneously relapsing inflammation associated with human IBD, 

25 However, a recently described model using subserosal injections of purified 
peptidoglycan-polysaccharide (PG-PS) polymers from either group A or group D 
streptococci appears to be a more physiologically relevant model for human IBD 
[Yamada, et al, Castroenterolgy 104:159-111 (1993)]. 
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In this model PG-PS is injected into the subserosal layer of the 
distal colon. The resulting inflammatory response is biphasic with an initial acute 
episode three days after injection, which is followed by a spontaneous chronic 
phase three to four weeks later. The late phase response is granulomatous in 
5 nature, and results in colonic thickening, adhesions, colonic nodules and mucosal 

lesions. In addition to mucosal injury, PG-PS colitis frequently leads to arthritis 
anemia and granulomatous hepatitis. The extraintestinal manifestations of the 
disease make the model attractive for studying Crohn's colitis in that a significant 
number of patients with active Crohn's disease suffer from arthritic joint disease 

10 and hepatobiliary inflammation. 

Granulomatous lesions are the result of chronic inflammation which 
leads to the recruitment and subsequent activation of cells of the 
monocyte/macrophage lineage. Presence of granulomatous lesions in Crohn's 
disease and the above animal model make this an attractive clinical target for 

15 monoclonal antibodies or other inhibitors of function. Inhibitors of 

function are expected to block the formation of lesions associated with IBD or 
even reverse tissue damage seen in the disease. 



E. Arthritis 

Arthritis appears to be a multi-factorial disease process involving 
20 a variety of inflammatory cell types including neutrophils, T lymphocytes and 
phagocytic macrophages. Although a variety of arthritis models exist, 
preparations of streptococcal cell wall proteoglycan produce a disorder most 
similar to the human disease. 

In rats, streptococcal cell wall induces inflammation of peripheral 
25 joints characterized by repeated episodes of disease progression followed by 

remission and eventually resulting in joint destruction over a period of several 
months [Cromartie, et al, J.Exp.Med. 745:1585-1602 (1977); Schwab et al. 
Infection and Immunity 5P:4436-4442 (1991)]. During the chronic phase of the 
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disease, mononuclear phagocytes or macrophages are believed to play a major 
role in destruction of the synovium. Furthermore, agents which suppress the 
recruitment of macrophages into the synovium effectively reduce the inflammation 
and pathology characteristic of arthritis. 

5 A central role for the macrophage in synovium destruction that 

leads to arthritis predicts that monoclonal antibodies to or inhibitors of 
function may have therapeutic potential in the treatment of this disease. As in 
other models previously described, monoclonal antibodies or small molecule 
inhibitors administered prophylactically are expected to block or moderate joint 

10 inflammation and prevent destruction of the synovium. Agents that interfere with 
function may also moderate ongoing inflammation by preventing the 
recruitment of additional macrophages to the joint or blocking macrophage 
activation. The net result would be to reverse ongoing destruction of the joint and 
facilitate tissue repair. 



15 F. Multiple Sclerosis 

Although pathogenesis of multiple sclerosis (MS) remains unclear, 
it is generally accepted that the disease is mediated by CD4'^ T cells which 
recognize autoantigens in the central nervous system and initiate an inflammatory 
cascade. The resulting immune response results in the recruitment of additional 

20 inflammatory cells, including activated macrophages which contribute to the 

disease. Experimental autoimmune encephalomyelitis (EAE) is an animal model 
which reproduces some aspects of MS. Recently, monoclonal antibodies reactive 
with CDllb/CD18 [Huitinga, et al, EurJJmmunoL 25:709-715 (1993)] present 
on inflammatory macrophages have been shown to block both chnical and 

25 histologic disease. The results suggest that monoclonal antibodies or small 

molecule inhibitors to are likely to be effective in blocking the inflammatory 
response in EAE. Such agents also have important therapeutic applications in the 
treatment of MS. 
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Numerous modifications and variations in the invention as set forth 
in the above illustrative examples are expected to occur to those skilled in the art. 
Consequently only such limitations as appear in the appended claims should be 
placed on the invention. 



V 
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( ix ) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 3.. 3485 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TG ACC TTC GGC ACT GTG CTT CTT CTG AGT GTC CTG GCT TCT TAT CAT 47 
Thr Phe Gly Thr Val Leu Leu Leu Ser Val Leu Ala Ser Tyr His 
15 10 15 

GGA TTC AAC CTG GAT GTG GAG GAG CCT ACG ATC TTC CAG GAG GAT GCA 95 
Gly Phe Asn Leu Asp Val Glu Glu Pro Thr lie Phe Gin Glu Asp Ala 
20 25 30 

GGC GGC TTT GGG CAG AGC GTG GTG CAG TTC GGT GGA TCT CGA CTC GTG 143 
Gly Gly Phe Gly Gin Ser Val Val Gin Phe Gly Gly Ser Arg Leu Val 
35 40 45 

GTG GGA GCA CCC CTG GAG GTG GTG GCG GCC AAC CAG ACG GGA CGG CTG 191 
Val Gly Ala Pro Leu Glu Val Val Ala Ala Asn Gin Thr Gly Arg Leu 
50 55 60 

TAT GAC TGC GCA GCT GCC ACC GGC ATG TGC CAG CCC ATC CCG CTG CAC 239 
Tyr Asp Cys Ala Ala Ala Thr Gly Met Cys Gin Pro lie Pro Leu His 
65 70 75 

ATC CGC CCT GAG GCC GTG 7UVC ATG TCC TTG GGC CTG ACC CTG GCA GCC 287 
lie Arg Pro Glu Ala Val Asn Met: Ser Leu Gly Leu Thr Leu Ala Ala 
80 85 90 95 

TCC ACC AAC GGC TCC CGG CTC CTG GCC TGT GGC CCG ACC CTG CAC AGA 335 
Ser Thr Asn Gly Ser Arg Leu Leu Ala Cys Gly Pro Thr Leu His Arg 
100 105 110 

GTC TGT GGG GAG AAC TCA TAG TCA AAG GGT TCC TGC CTC CTG CTG GGC 383 
Val Cys Gly Glu Asn Ser Tyr Ser Lys Gly Ser Cys Leu Leu Leu Gly 
115 120 ^ 125 

TCG CGC TGG GAG ATC ATC CAG ACA GTC CCC GAC GCC ACG CCA GAG TGT 431 
Ser Arg Trp Glu He He Gin Thr Val Pro Asp Ala Thr Pro Glu Cys 
130 135 140 

CCA CAT CAA GAG ATG GAC ATC GTC TTC CTG ATT GAC GGC TCT GGA AGC 479 
Pro His Gin Glu Met Asp He Val Phe Leu He Asp Gly Ser Gly Ser 
145 150 155 

ATT GAC CAA AAT GAC TTT AAC CAG ATG AAG GGC TTT GTC CAA GCT GTC 527 
He Asp Gin Asn Asp Phe Asn Gin Met Lys Gly Phe Val Gin Ala Val 
160 165 170 175 

ATG GGC CAG TTT GAG GGC ACT GAC ACC CTG TTT GCA CTG ATG CAG TAC 575 
Met Gly Gin Phe Glu Gly Thr Asp Thr Leu Phe Ala Leu Met Gin Tyr 
180 185 190 

TCA AAC CTC CTG AAG ATC CAC TTC ACC TTC ACC CAA TTC CGG ACC AGC 623 
Ser Asn Leu Leu Lys He His Phe Thr Phe Thr Gin Phe Arg Thr Ser 
195 200 205 

CCG AGC CAG CAG AGC CTG GTG GAT CCC ATC GTC CAA CTG AAA GGC CTG 671 
Pro Ser Gin Gin Ser Leu Val Asp Pro He Val Gin Leu Lys Gly Leu 
210 215 220 
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ACG TTC ACG GCC AGO GGC ATC CTG ACA GTG GTG ACA GAG OTA TTT CAT 719 
Thr Phe Thr Ala Thr Gly lie Leu Thr Val Val Thr Gin Leu Phe His 
225 230 235 

CAT AAG AAT GGG GCC CGA AAA AGT GCC AAG AAG ATC CTC ATT GTC ATC 767 
His Lys Asn Gly Ala Arg Lye Ser Ala Lys Lys lie Leu lie Val lie 
240 245 250 255 

ACA GAT GGG CAG AAG TAG AAA GAC CCC CTG GAA TAC AGT GAT GTC ATC 815 
Thr ASP Gly Gin Lys Tyr Lys Asp Pro Leu Glu Tyr Ser Asp Val lie 
260 265 270 

CCC CAG GCA GAG AAG GCT GGC ATC ATC CGC TAC GCT ATC GGG GTG GGA 863 
Pro Gin Ala Glu Lys Ala Gly lie lie Arg Tyr Ala lie Gly Val Gly 
275 280 285 

CAC GCT TTC CAG GGA CCC ACT GCC AGG CAG GAG CTG AAT ACC ATC AGC 911 
His Ala Phe Gin Gly Pro Thr Ala Arg Gin Glu Leu Asn Thr He Ser 
290 295 3O0 

TCA GCG CCT CCG CAG GAC CAC GTG TTC AAG GTG GAC AAC TTT GCA GCC 959 
Ser Ala Pro Pro Gin Asp His Val Phe Lys Val Asp Asn Phe Ala Ala 
305 310 315 

CTT GGC AGC ATC CAG AAG CAG CTG CAG GAG AAG ATC TAT GCA GTT GAG 1007 
Leu Gly Ser He Gin Lys Gin Leu Gin Glu Lys lie Tyr Ala Val Glu 
320 325 330 335 

GGA ACC CAG TCC AGG GCA AGC AGC TCC TTC CAG CAC GAG ATG TCC CAA 1055 
Gly Thr Gin Ser Arg Ala Ser Ser Ser Phe Gin His Glu Met Ser Gin 
340 345 350 

GAA GGC TTC AGC ACA GCC CTC ACA ATG GAT GGC CTC TTC CTG GGG GCT 1103 
Glu Gly Phe Ser Thr Ala Leu Thr Met Asp Gly Leu Phe Leu Gly Ala 
355 360 365 

GTG GGG AGC TTT AGC TGG TCT GGA GGT GCC TTC CTG TAT CCC CCA AAT 1151 
Val Gly Ser Phe Ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro Pro Asn 
370 375 380 

ATG AGC CCC ACC TTC ATC AAC ATG TCT CAG GAG AAT GTG GAC ATG AGG 1199 
Met Ser Pro Thr Phe He Asn Met Ser Gin Glu Asn Val Asp Met Arg 
385 390 395 

GAC TCT TAC CTG GGT TAC TCC ACC GAG CTA GCC CTG TGG AAG GGG GTA 1247 
Asp Ser Tyr Leu Gly Tyr Ser Thr Glu Leu Ala Leu Trp Lys Gly Val 
400 405 410 415 

CAG AAC CTG GTC CTG GGG GCC CCC CGC TAC CAG CAT ACC GGG AAG GCT 1295 
Gin Asn Leu Val Leu Gly Ala Pro Arg Tyr Gin His Thr Gly Lys Ala 
420 425 430 

GTC ATC TTC ACC CAG GTG TCC AGG CAA TGG AGG AAG AAG GCC GAA GTC 1343 
Val He Phe Thr Gin Val Ser Arg Gin Trp Arg Lys Lys Ala Glu Val 
435 440 445 

ACA GGG ACG CAG ATC GGC TCC TAC TTC GGG GCC TCC CTC TGC TCC GTG 1391 
Thr Gly Thr Gin He Gly Ser Tyr Phe Gly Ala Ser Leu Cys Ser Val 
450 455 460 
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GAT GTG GAG AGO GAT GGC AGC ACC GAC CTG ATC CTC ATT GGG GCC CCC 1439 
Asp Val Asp Ser Asp Gly Ser Thr Asp Leu lie Leu He Gly Ala Pro 
465 470 475 

CAT TAG TAT GAG GAG ACC CGA GGG GGC CAG GTG TCC GTG TGT CCC TTG 1487 
His Tyr Tyr Glu Gin Thr Arg Gly Gly Gin Val Ser Val Cys Pro Leu 
480 485 490 495 

CCT AGG GGG CAG AGG GTG CAG TGG CAG TGT GAC GCT GTT CTC CGT GGT 1535 
Pro Arg Gly Gin Arg Val Gin Trp Gin Cys Asp Ala Val Leu Arg Gly 
500 505 510 

GAG CAG GGC CAC CCC TGG GGC CGC TTT GGG GCA GCC CTG ACA GTG TTG 1583 
Glu Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu 
515 520 525 

GGG GAT GTG AAT GAG GAC AAG CTG ATA GAC GTG GCC ATT GGG GCC CCG 1631 
Gly Asp Val Asn Glu Asp Lys Leu He Asp Val Ala lie Gly Ala Pro 
530 535 540 

GGA GAG CAG GAG AAC CGG GGT GCT GTC TAG CTG TTT CAC GGA GCC TCA 1679 
Gly Glu Gin Glu Asn Arg Gly Ala Val Tyr Leu Phe His Gly Ala Ser 
545 550 555 

GAA TCC GGC ATC AGC CCC TCC CAC AGC CAG CGG ATT GCC AGC TCC CAG 1727 
Glu Ser Gly He Ser Pro Ser His Ser Gin Arg He Ala Ser Ser Gin 
560 565 570 575 

CTC TCC CCC AGG CTG CAG TAT TTT GGG CAG GGG CTG AGT GGG GGT CAG 1775 
Leu Ser Pro Arg Leu Gin Tyr Phe Gly Gin Ala Leu Ser Gly Gly Gin 
580 585 590 

GAC CTC ACC CAG GAT GGA CTG ATG GAC CTG GCC GTG GGG GCC CGG GGC 1823 
Asp Leu Thr Gin Asp Gly Leu Met Asp Leu Ala Val Gly Ala Arg Gly 
595 600 605 

CAG GTG CTC CTG CTC AGG AGT CTG CCG GTG CTG AAA GTG GGG GTG GCC 1871 
Gin Val Leu Leu Leu Arg Ser Leu Pro Val Leu Lys Val Gly Val Ala 
610 615 620 

ATG AGA TTC AGC CCT GTG GAG GTG GCC AAG GCT GTG TAC CGG TGC TGG 1919 
Met Arg Phe Ser Pro Val Glu Val Ala Lys Ala Val Tyr Arg Cys Trp 
625 630 635 

GAA GAG AAG CCC AGT GCC CTG GAA GCT GGG GAC GCC ACC GTC TGT CTC 1967 
Glu Glu Lys Pro Ser Ala Leu Glu Ala Gly Asp Ala Thr Val Cys Leu 
640 645 650 655 

ACC ATC CAG AAA AGC TCA CTG GAC CAG CTA GGT GAC ATC CAA AGC TCT 2015 
Thr He Gin Lys Ser Ser Leu Asp Gin Leu Gly Asp He Gin Ser Ser 
660 665 670 

GTC AGG TTT GAT CTG GCA CTG GAC CCA GGT CGT CTG ACT TCT CGT GCC 2063 
Val Arg Phe Asp Leu Ala Leu Asp Pro Gly Arg Leu Thr Ser Arg Ala 
675 680 685 

ATT TTC AAT GAA ACC AAG AAC CCC ACT TTG ACT CGA AGA AAA ACC CTG 2111 
He Phe Asn Glu Thr Lys Asn Pro Thr Leu Thr Arg Arg Lys Thr Leu 
690 695 700 
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GGA CTG GGG ATT CAC TGT GAA ACC CTG AAG CTG CTT TTG CCA GAT TGT 2159 
Glv Leu Gly He His Cys Glu Thr Leu Lys Leu Leu Leu Pro Asp Cys 
" 705 710 715 



GTG GAG GAT GTG GTG AGC CCC ATC ATT CTG CAC CTC AAC TTC TCA CTG 2207 
Val Glu Asp Val Val Ser Pro He He Leu Hie Leu Asn Phe Ser Leu 
720 725 730 735 



GTG AGA GAG CCC ATC CCC TCC CCC CAG AAC CTG CGT CCT GTG CTG GCC 2255 
Val Arg Glu Pro He Pro Ser Pro Gin Asn Leu Arg Pro Val Leu Ala 
740 745 750 

GTG GGC TCA CAA GAC CTC TTC ACT GCT TCT CTC CCC TTC GAG AAG AAC 2303 
Val Glv Ser Cln Asp Leu Phe Thr Ala Ser Leu Pro Phe Glu Lys Asn 
^ 755 760 765 

TGT GGG CAA GAT CCC CTC TGT GAA GGG GAC CTG GGT GTC ACC CTC AGC 2351 
Cys Gly Cln Asp Cly Leu Cys Glu Gly Asp Leu Gly Val Thr Leu Ser 
770 775 780 

TTC TCA GCC CTC CAG ACC CTG ACC GTG GGG AGC TCC CTG GAG CTC AAC 2399 
Phe Ser Cly Leu Cln Thr Leu Thr Val Gly Ser Ser Leu Glu Leu Asn 
785 790 795 

GTG ATT GTG ACT CTC TCC AAC CCA GGT GAG GAT TCC TAC GGA ACC GTG 2447 
Val He Val Thr Val Trp Asn Ala Gly Glu Asp Ser Tyr Gly Thr Val 
800 805 810 815 

GTC AGC CTC TAC TAT CCA GCA GGG CTG TCC CAC CGA CGG GTG TCA GGA 2495 
Val Ser Leu Tyr Tyr Pro Ala Gly Leu Ser His Arg Arg Val Ser Gly 
820 825 830 

GCC CAG AAG CAC CCC CAT CAG AGT GCC CTG CGC CTG GCA TGT GAG ACA 2543 
Ala Gin Lys Gin Pro His Gin Ser Ala Leu Arg Leu Ala Cys Glu Thr 
835 840 845 

GTG CCC ACT GAG GAT GAG GGC CTA AGA AGC AGC CGC TGC AGT GTC AAC 2591 
Val Pro Thr Glu Asp Glu Gly Leu Arg Ser Ser Arg Cys Ser Val Asn 
850 855 860 

CAC CCC ATC TTC CAT GAG GGC TCT AAC GGC ACC TTC ATA GTC ACA TTC 2639 
His Pro He Phe His Glu Gly Ser Asn Gly Thr Phe He Val Thr Phe 
865 870 875 

GAT GTC TCC TAC AAG GCC ACC CTG GGA GAC AGG ATG CTT ATG AGG GCC 2687 
Asp Val Ser Tyr Lys Ala Thr Leu Gly Asp Arg Met Leu Met Arg Ala 
880 885 890 895 

AGT GCA AGC AGT GAG AAC AAT AAG GCT TCA AGC AGC AAG GCC ACC TTC 2735 
Ser Ala Ser Ser Glu Asn Asn Lys Ala Ser Ser Ser Lys Ala Thr Phe 
900 905 910 

CAG CTG GAG CTC CCG GTG AAG TAT GCA GTC TAC ACC ATG ATC AGC AGG 2783 
Gin Leu Glu Leu Pro Val Lys Tyr Ala Val Tyr Thr Met He Ser Arg 
915 920 925 

CAG GAA GAA TCC ACC AAG TAC TTC AAC TTT GCA ACC TCC GAT GAG AAG 2831 
Gin Glu Glu Ser Thr Lys Tyr Phe Asn Phe Ala Thr ser Asp Glu Lys 
930 935 940 
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AAA 
Lys 


ATG 
Met 
945 


AAA 
Lys 


GAG 
Glu 


GCT 
Ala 


GAG 
Glu 


CAT 
His 
950 


CGA 
Arg 


TAC 
Tyr 


CGT 
Arg 


GTG 
Val 


AAT 
Asn 
955 


AAC 
Asn 


CTC 
Leu 


AGC 
Ser 


CAG 
Gin 


2879 


CGA 
Arg 
960 


GAT 
Asp 


CTG 
Leu 


GCC 
Ala 


ATC 
He 


AGC 
Ser 
965 


ATT 
He 


AAC 
Asn 


TTC 
Phe 


TGG 
Trp 


GTT 
Val 
970 


CCT 
Pro 


GTC 
Val 


CTG 
Leu 


CTG 
Leu 


AAC 
Asn 
975 


2927 


GGG 
Gly 


GTG 
Val 


GCT 
Ala 


GTG 
Val 


TGG GAT GTG 
Trp Asp Val 
980 


GTC 
Val 


ATG 
Met 


GAG 
Glu 
985 


GCC 
Ala 


CCA 
Pro 


TCT 
ser 


CAG 
Gin 


AGT 
ser 
990 


CTC 
Leu 


2975 


CCC 
Pro 


TGT 
Cys 


GTT 
Val 


TCA 
Ser 
995 


GAG 
Glu 


AGA 
Arg 


AAA 
Lys 


CCT 
Pro 


CCC CAG 
Pro Gin 
1000 


CAT 
His 


TCT 
Ser 


GAC 
Asp 


TTC CTG 
Phe Leu 
1005 


ACC 
Thr 


3023 


CAG 
Gin 


ATT 
He 


TCA AGA 
Ser Arg 
1010 


AGT 
Ser 


CCC 
Pro 


ATG 
Met 


CTG GAC 
Leu Asp 
1015 


TGC 
Cys 


TCC 
Ser 


ATT 
He 


GCT GAC 
Ala Asp 
1020 


TGC 
Cys 


CTG 
Leu 


3071 


CAG 
Gin 


TTC CGC 
Phe Arg 
1025 


TGT 
Cys 


GAC 
Asp 


GTC 
Val 


CCC TCC 
Pro Ser 
1030 


TTC 
Phe 


AGC 
Ser 


GTC 
Val 


CAG GAG 
Gin Glu 
1035 


GAG 
Glu 


CTG 
Leu 


GAT 
Asp 


3119 


TTC ACC 
Phe Thr 
1040 


CTG 
Leu 


AAG 
Lys 


GGC AAT CTC 
Gly Asn Leu 
1045 


AGT 
Ser 


TTC 
Phe 


GGC 
Gly 


TGG GTC 
Trp Val 
1050 


CGC 
Arg 


GAG 
Glu 


ACA 
Thr 


TTG 
Leu 
1055 


3167 


CAG 
Gin 


AAG 
Lys 


AAG 
Lys 


GTG 
Val 


TTG GTC 
Leu Val 
1060 


GTG 
Val 


AGT 
Ser 


GTG 
Val 


GCT GAA 
Ala Glu 
1065 


ATT 
He 


ACG 
Thr 


TTC 
Phe 


GAC ACA 
Asp Thr 
1070 


3215 


TCC 
Ser 


GTG 
Val 


TAC 
Tyr 


TCC CAG 
Ser Gin 
1075 


CTT 
Leu 


CCA 
Pro 


GGA 
Gly 


CAG GAG 
Gin Glu 
1080 


GCA 
Ala 


TTT 
Phe 


ATG 
Met 


AGA GCT 
Arg Ala 
1085 


CAG 
Gin 


3263 


ATG 
Met 


GAG 
Glu 


ATG GTG 
Met Val 
1090 


CTA 
Leu 


GAA 
Glu 


GAA 
Glu 


GAC GAG 
Asp Glu 
1095 


GTC 
Val 


TAC 
Tyr 


AAT 
Asn 


GCC ATT 
Ala He 
1100 


CCC 
Pro 


ATC 
He 




ATC 
lie 


ATG GGC 
Met Gly 
1105 


AGC 
Ser 


TCT 
Ser 


GTG 
Val 


GGG GCT 
Gly Ala 
1110 


CTG 
Leu 


CTA 
Leu 


CTG 
Leu 


CTG GCG 
Leu Ala 
1115 


CTC 
Leu 


ATC 
He 


ACA 
Thr 


3359 


GCC ACA 
Ala Thr 
1120 


CTG 
Leu 


TAC 
Tyr 


AAG 
Lys 


CTT GGC 
Leu Gly 
1125 


TTC 
Phe 


TTC 
Phe 


AAA 
Lys 


CGC 
Arg 
113C 


CAC 
His 


TAC 
Tyr 


AAG 
Lys 


GAA 
Glu 


ATG 
Met 
1135 


3407 


CTG 
Leu 


GAG 
Glu 


GAC 
Asp 


AAG 
Lys 


CCT GAA 
Pro Glu 
1140 


GAC 
Asp 


ACT 
Thr 


GCC 
Ala 


ACA 
Thr 
1145 


TTC 
Phe 


AGT 
Ser 


GGG 
Gly 


GAC 
Asp 


GAT 
Asp 
1150 


TTC 
Phe 


3455 


AGC 
Ser 


TGT 
Cys 


GTG 
Val 


GCC 
Ala 


CCA 
Pro 


AAT 
Asn 


GTG 
Val 


CCT 
Pro 


TTG 
Leu 


TCC 
Ser 


TAATAATCCA CTTTCCTGTT 


3505 



1155 1160 

TATCTCTACC ACTGTGGGCT GGACTTGCTT GCAACCATAA ATCAACTTAC ATGGAAACAA 3566 

CTTCTGCATA GATCTGCACT GGCCTAAGCA ACCTACCAGG TGCTAAGCAC CTTCTCGGAG 3625 

AGATAGAGAT TGTAATGTTT TTACATATCT GTCCATCTTT TTCAGCAATG ACCCACTTTT 3685 
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TACAGAAGCA GGCATGGTGC CAGCATAAAT TTTCATATGC T 3726 
(2) INFORMATION FOR SEQ ID NO; 2; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 eunino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Thr Phe Glv Thr Val Leu Leu Leu Ser Val Leu Ala Ser Tyr His Gly 
15 10 15 

Phe Asn Leu Asp Val Glu Glu Pro Thr lie Phe Gin Glu Asp Ala Gly 
20 25 30 

Gly Phe Gly Gin Ser Val Val Gin Phe Gly Gly Ser Arg Leu Val Val 
35 40 45 

Gly Ala Pro Leu Glu Val Val Ala Ala Asn Gin Thr Gly Arg Leu Tyr 
50 55 60 

Asp Cys Ala Ala Ala Thr Gly Met Cys Gin Pro lie Pro Leu His He 
65 70 75 80 

Arg Pro Glu Ala Val Asn Met Ser Leu Gly Leu Thr Leu Ala Ala Ser 
85 90 95 

Thr Asn Gly Ser Arg Leu Leu Ala Cys Gly Pro Thr Leu His Arg Val 
100 105 110 

Cys Gly Glu Asn Ser Tyr Ser Lys Gly Ser Cys Leu Leu Leu Gly Ser 
115 120 125 

Arg Trp Glu He He Gin Thr Val Pro Asp Ala Thr Pro Glu Cys Pro 
130 135 140 

His Gin Glu Met Asp He Val Phe Leu He Asp Gly Ser Gly Ser He 
145 150 155 160 

Asp Gin Asn Asp Phe Asn Gin Met Lys Gly Phe Val Gin Ala Val Met 
165 170 175 

Gly Gin Phe Glu Gly Thr Asp Thr Leu Phe Ala Leu Met Gin Tyr Ser 
180 185 190 

Asn Leu Leu Lys He His Phe Thr Phe Thr Gin Phe Arg Thr Ser Pro 
195 200 205 

Ser Gin Gin ser Leu Val Asp Pro He val Gin Leu Lys Gly Leu Thr 
210 215 220 

Phe Thr Ala Thr Gly He Leu Thr Val Val Thr Gin Leu Phe His His 
225 230 235 240 

Lys Asn Gly Ala Arg Lys Ser Ala Lys Lys He Leu He Val He Thr 
245 250 255 
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Asp Gly Gin Lys Tyr Lys Asp Pro Leu Glu Tyr Ser Asp Val lie Pro 
260 265 270 

Gin Ala Glu Lys Ala Gly lie lie Arg Tyr Ala He Gly Val Gly His 
275 280 285 

Ala Phe Gin Gly Pro Thr Ala Krg Gin Glu Leu Asn Thr He Ser Ser 
290 295 300 

Ala Pro Pro Gin Asp His Val Phe Lys Val Asp Asn Phe Ala Ala Leu 
305 310 315 320 

Gly Ser He Gin Lys Gin Leu Gin Glu Lys He Tyr Ala Val Glu Gly 
325 330 335 

Thr Gin Ser Arg Ala Ser Ser Ser Phe Gin His Glu Met Ser Gin Glu 
340 345 350 

Gly Phe Ser Thr Ala Leu Thr Met Asp Gly Leu Phe Leu Gly Ala Val 
355 360 365 

Gly Ser Pho Ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro Pro Asn Met 
370 375 380 

Ser Pro Thr Phe He Asn Met Ser Gin Glu Asn Val Asp Met Arg Asp 
385 390 395 400 

Ser Tyr Leu Gly Tyr Ser Thr Glu Leu Ala Leu Trp Lys Gly Val Gin 
405 410 415 

Asn Leu Val Leu Gly Ala Pro Arg Tyr Gin His Thr Gly Lys Ala Val 
420 425 430 

He Phe Thr Gin Val Ser Arg Gin Trp Arg Lys Lys Ala Glu Val Thr 
435 440 445 

Gly Thr Gin He Gly Ser Tyr Phe Gly Ala Ser Leu Cys Ser Val Asp 
450 455 460 

Val Asp Ser Asp Gly Ser Thr Asp Leu He Leu He Gly Ala Pro His 
465 470 475 480 

Tyr Tyr Glu Gin Thr Arg Gly Gly Gin Val Ser Val Cys Pro Leu Pro 
485 490 495 

Arg Gly Gin Arg Val Gin Trp Gin Cys Asp Ala Val Leu Arg Gly Glu 
500 505 510 

Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu Gly 
515 520 525 

Asp Val Asn Glu Asp Lys Leu He Asp Val Ala He Gly Ala Pro Gly 
530 535 540 

Glu Gin Glu Asn Arg Gly Ala Val Tyr Leu Phe His Gly Ala Ser Glu 
545 550 555 560 

Ser Gly He Ser Pro Ser His Ser Gin Arg He Ala Ser Ser Gin Leu 
565 570 575 
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Ser Pro Arg Leu Gin Tyr Phe Giy Gin Ala Leu Ser Gly Gly Gin Asp 
580 585 590 

Leu Thr Gin Asp Gly Leu Met Asp Leu Ala Val Gly Ala Arg Gly Gin 
595 600 605 

Val Leu Leu Leu Arg Ser Leu Pro Val Leu Lys Val Gly Val Ala Met 
610 615 620 

Arg Phe ser Pro Val Glu Val Ala Lys Ala Val Tyr Arg Cys Trp Glu 
625 630 635 640 

Glu Lys Pro Ser Ala Leu Glu Ala Gly Asp Ala Thr Val Cys Leu Thr 
645 650 655 

He Gin Lys Ser Ser Leu Asp Gin Leu Gly Asp He Gin Ser Ser Val 
660 665 670 

Arg Phe Asp Leu Ala Leu Asp Pro Gly Arg Leu Thr Ser Arg Ala He 
675 680 685 

Phe Asn Glu Thr Lys Asn Pro Thr Leu Thr Arg Arg Lys Thr Leu Gly 
690 695 700 

Leu Gly He His Cys Glu Thr Leu Lys Leu Leu Leu Pro Asp Cys Val 
705 710 715 720 

Glu Asp Val Val Ser Pro He He Leu His Leu Asn Phe Ser Leu Val 
725 730 735 

Arg Glu Pro He Pro Ser Pro Gin Asn Leu Arg Fro Val Leu Ala Val 
740 745 750 

Gly Ser Gin Asp Leu Phe Thr Ala Ser Leu Pro Phe Glu Lys Asn Cys 
755 760 765 

Gly Gin Asp Gly Leu Cys Glu Gly Asp Leu Gly Val Thr Leu Ser Phe 
770 775 780 

Ser Gly Leu Gin Thr Leu Thr Val Gly Ser Ser Leu Glu Leu Asn Val 
785 790 795 800 

He Val Thr Val Trp Asn Ala Gly Glu Asp Ser Tyr Gly Thr Val Val 
805 810 815 

Ser Leu Tyr Tyr Pro Ala Gly Leu Ser His Arg Arg Val Ser Gly Ala 
820 825 830 

Gin Lys Gin Pro His Gin Ser Ala Leu Arg Leu Ala Cys Glu Thr Val 
835 840 845 

Pro Thr Glu Asp Glu Gly Leu Arg Ser Ser Arg Cys Ser Val Asn His 
850 855 860 

Pro He Phe His Glu Gly Ser Asn Gly Thr Phe He Val Thr Phe Asp 
865 870 875 880 

Val Ser Tyr Lys Ala Thr Leu Gly Asp Arg Met Leu Met Arg Ala Ser 
885 890 895 
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Ala Ser Ser Glu Asn Asn Lys Ala Ser Ser Ser Lys Ala Thr Phe Gin 
900 905 910 

Leu Glu Leu Pro Val Lys Tyr Ala Val Tyr Thr Met lie Ser Arg Gin 
915 920 925 

Glu Glu Ser Thr Lys Tyr Phe Asn Phe Ala Thr Ser Asp Glu Lys Lys 
930 935 940 

Met Lye Glu Ala Glu His Arg Tyr Arg Val Asn Asn Leu Ser Gin Arg 
945 950 955 960 

Asp Leu Ala lie Ser lie Asn Phe Trp Val Pro Val Leu Leu Asn Gly 
965 970 975 

Val Ala Val Trp Asp Val Val Met Glu Ala Pro Ser Gin Ser Leu Pro 
980 985 990 

Cys Val Ser Glu Arg Lys Pro Pro Gin His Ser Asp Phe Leu Thr Gin 
995 1000 1005 

lie Ser Arg Ser Pro Met Leu Asp Cys Ser lie Ala Asp Cys Leu Gin 
1010 1015 1020 

Phe Arg Cys Asp Val Pro Ser Phe Ser Val Gin Glu Glu Leu Asp Phe 
1025 1030 1035 1Q40 

Thr Leu Lys Gly Asn Leu Ser Phe Gly Trp Val Arg Glu Thr Leu Gin 
1045 1050 1055 

Lys Lys Val Leu Val Val Ser Val Ala Glu lie Thr Phe Asp Thr Ser 
1060 1065 1070 

Val Tyr Ser Gin Leu Pro Gly Gin Glu Ala Phe Met Arg Ala Gin Met 
1075 1080 1085 

Glu Met Val Leu Glu Glu Asp Glu Val Tyr Asn Ala lie Pro lie lie 
1090 1095 1100 

Met Gly Ser Ser Val Gly Ala Leu Leu Leu Leu Ala Leu lie Thr Ala 
1105 1110 1115 1120 

Thr Leu Tyr Lys Leu Gly Phe Phe Lys Arg His Tyr Lys Glu Met Leu 
1125 1130 1135 

Glu Asp Lys Pro Glu Asp Thr Ala Thr Phe Ser Gly Asp Asp Phe Ser 
1140 1145 1150 

Cys Val Ala Pro Asn Val Pro Lys Ser 
1155 1160 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1153 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION 2 SEQ ID NO: 3: 

Met Ala Leu Arg Val Leu Leu Leu Thr Ala Leu Thr Leu Cys His Gly 
15 10 15 

Phe Asn Leu Asp Thr Glu Asn Ala Met Thr Phe Gin Glu Asn Ala Arg 
20 25 30 

Gly Phe Gly Gin Ser Val Val Gin Leu Gin Gly Ser Arg Val Val Val 
35 40 45 

Gly Ala Pro Gin Glu lie Val Ala Ala Asn Gin Arg Gly Ser Leu Tyr 
50 55 60 

Gin Cys Asp Tyr Ser Thr Gly Ser Cys Glu Pro lie Arg Leu Gin Val 
65 70 75 80 

Pro Val Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu Ala Ala Thr 
85 90 95 

Thr Ser Pro Pro Gin Leu Leu Ala Cys Gly Pro Thr Val His Gin Thr 
100 105 110 

Cys Ser Glu Asn Thr Tyr Val Lys Gly Leu Cys Phe Leu Phe Gly Ser 
115 120 125 

Asn Leu Arg Gin Gin Pro Gin Lys Phe Pro Glu Ala Leu Arg Gly Cys 
130 135 140 

Pro Gin Glu Asp Ser Asp lie Ala Phe Leu lie Asp Gly Ser Gly Ser 
145 ISO 155 160 

lie lie Pro His Asp Phe Arg Arg Met Lys Glu Phe Val Ser Thr Val 
165 170 175 

Met Glu Gin Leu Lys Lys Ser Lys Thr Leu Phe Ser Leu Met Gin Tyr 
180 185 190 

Ser Glu Glu Phe Arg lie His Phe Thr Phe Lys Glu Phe Gin Asn Asn 
195 200 205 

Pro Asn Pro Arg Ser Leu Val Lys Pro He Thr Gin Leu Leu Gly Arg 
210 215 220 

Thr His Thr Ala Thr Gly He Arg Lys Val Val Arg Glu Leu Phe Asn 
225 230 235 240 

He Thr Asn Gly Ala Arg Lys Asn Ala Phe Lys He Leu Val Val He 
245 250 255 

Thr Asp Gly Glu Lys Phe Gly Asp Pro Leu Gly Tyr Glu Asp Val He 
260 265 270 

Pro Glu Ala Asp Arg Glu Gly Val He Arg Tyr Val He Gly Val Gly 
275 280 285 

Asp Ala Phe Arg Ser Glu Lys Ser Arg Gin Glu Leu Asn Thr He Ala 
290 295 300 



wo 95/17412 



PCT/US94/14832 



-92- 

Ser Lya Pro Pro Arg Asp His Val Phe Gin Val Asn Asn Phe Glu Ala 
305 310 315 320 

Leu Lys Thr lie Gin Asn Gin Leu Arg Glu Lys lie Phe Ala Xle Glu 
325 330 335 

Gly Thr Gin Thr Gly Ser Ser Ser Ser Phe Glu His Glu Met Ser Gin 
340 345 350 

Glu Gly Phe Ser Ala' Ala lie Thr Ser Asn Gly Pro Leu Leu Ser Thr 
355 360 365 

Val Gly Ser Tyr Asp Trp Ala Gly Gly Val Phe Leu Tyr Thr Ser Lys 
370 375 380 

Glu Lys Ser Thr Phe lie Asn Met Thr Arg Val Asp Ser Asp Met: Asn 
385 390 395 400 

Asp Ala Tyr Leu Gly Tyr Ala Ala Ala lie lie Leu Arg Asn Arg Val 
405 410 415 

Gin Ser Leu Val Leu Gly Ala Pro Arg Tyr Gin His lie Gly Leu Val 
420 425 430 

Ala Met Phe Arg Gin Asn Thr Gly Met Trp Glu Ser Asn Ala Asn Val 
435 440 445 

Lys Gly Thr Gin lie Gly Ala Tyr Phe Gly Ala Ser Leu Cys Ser Val 
450 455 460 

Asp Val Asp Ser Asn Gly Ser Thr Asp Leu Val Leu lie Gly Ala Pro 
465 470 475 480 

His Tyr Tyr Glu Gin Thr Arg Gly Gly Gin Val Ser Val Cys Pro Leu 
485 490 495 

Pro Arg Gly Gin Arg Ala Arg Trp Gin Cys Asp Ala Val Leu Tyr Gly 
500 505 510 

Glu Gin Gly Gin Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu 
515 520 525 

Gly Asp Val Asn Gly Asp Lys Leu Thr Asp Val Ala lie Gly Ala Pro 
530 535 540 

Gly Glu Glu Asp Asn Arg Gly Ala Val Tyr Leu Phe His Gly Thr Ser 
545 550 555 560 

Gly Ser Gly lie Ser Pro Ser His Ser Gin Arg lie Ala Gly Ser Lys 
565 570 575 

Leu Ser Pro Arg Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly Gin 
580 585 590 

Asp Leu Thr Met Asp Gly Leu Val Asp Leu Thr Val Gly Ala Gin Gly 
595 600 605 

His Val Leu Leu Leu Arg Ser Gin Pro Val Leu Arg Val Lys Ala lie 
610 615 620 
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Met Glu Phe Asn Pro Arg Glu Val Ala Arg Aan Val Phe Glu Cys Asn 
625 630 635 640 

Asp Gin Val Val Lys Gly Lys Glu Ala Gly Glu Val Arg Val Cys Leu 
^ 645 650 655 

His Val Gin Lys Ser Thr Arg Asp Arg Leu Arg Glu Gly Gin lie Gin 
660 665 670 

Ser Val Val Thr Tyr Asp Leu Ala Leu Asp Ser Gly Arg Pro His. Ser 
675 680 685 

Arg Ala Val Phe Asn Glu Thr Lys Asn Ser Thr Arg Arg Gin Thr Gin 
690 695 700 

Val Leu Gly Leu Thr Gin Thr Cys Glu Thr Leu Lys Leu Gin Leu Pro 
705 710 715 720 

Asn Cys He Glu Asp Pro Val Ser Pro He Val Leu Arg Leu Asn Phe 
725 730 735 

Ser Leu Val Gly Thr Pro Leu Ser Ala Phe Gly Asn Leu Arg Pro Val 
740 745 750 

Leu Ala Glu Asp Ala Gin Arg Leu Phe Thr Ala Leu Phe Pro Phe Glu 
755 760 765 

Lys Asn Cys Gly Asn Asp Asn He Cys Gin Asp Asp Leu Ser He Thr 
770 775 780 

Phe Ser Phe Met Ser Leu Asp Cys Leu Val Val Gly Gly Pro Arg Glu 
785 790 795 BOO 

Phe Asn Val Thr Val Thr Val Arg Asn Asp Gly Glu Asp Ser Tyr Arg 
805 810 815 

Thr Gin Val Thr Phe Phe Phe Pro Leu Asp Leu Ser Tyr Arg Lys Val 
820 825 830 

Ser Thr Leu Gin Asn Gin Arg Ser Gin Arg Ser Trp Arg Leu Ala Cys 
835 840 845 

Glu Ser Ala Ser Ser Thr Glu Val Ser Gly Ala Leu Lys Ser Thr Ser 
850 855 860 

Cys Ser He Asn His Pro He Phe Pro Glu Asn Ser Glu Val Thr Phe 
865 870 875 880 

Asn He Thr Phe Asp Val Asp Ser Lys Ala Ser Leu Gly Asn Lys Leu 
885 890 895 

Leu Leu Lys Ala Asn Val Thr Ser Glu Asn Asn Met Pro Arg Thr Asn 
900 905 910 

Lys Thr Glu Phe Gin Leu Glu Leu Pro Val Lys Tyr Ala Val Tyr Met 
915 920 925 

Val Val Thr Ser His Gly Val Ser Thr Lys Tyr Leu Asn Phe Thr Ala 
930 935 940 
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Ser Glu Asn Xhr Ser Arg Val Met Gin His Gin Tyr Gin Val Ser Asn 
945 950 955 960 

Leu Gly Gin Arg Ser Leu Pro He Ser Leu Val Phe Leu Val Pro Val 
965 970 975 

Arg Leu Asn Gin Xhr Val He Trp Asp Arg Pro Gin Val Thr Phe Ser 
980 985 990 

Glu Asn Leu Ser Ser Thr Cys His Thr Lys Glu Arg Leu Pro Ser His 
995 1000 1005 

Ser Asp Phe Leu Ala Glu Leu Arg Lys Ala Pro Val Val Asn Cys Ser 
1010 1015 1020 

He Ala Val Cys Gin Arg He Gin Cys Asp He Pro Phe Phe Gly He 
1025 103O 1035 1040 

Gin Glu Glu Phe Asn Ala Thr Leu Lys Gly Asn Leu Ser Phe Asp Trp 
1045 1050 1055 

Tyr He Lys Thr Ser His Asn His Leu Leu He Val Ser Thr Ala Glu 
1060 1065 1070 

He Leu Phe Asn Asp Ser Val Phe Thr Leu Leu Pro Gly Gin Gly Ala 
1075 1080 1085 

Phe Val Arg Ser Gin Thr Glu Thr Lys Val Glu Pro Phe Glu Val Pro 
1090 1095 1100 

Asn Pro Leu Pro Leu He Val Gly Ser Ser Val Gly Gly Leu Leu Leu 
1105 1110 1115 1120 

Leu Ala Leu He Thr Ala Ala Leu Tyr Lys Leu Gly Phe Phe Lys Arg 
1125 1130 1135 

Gin Tyr Lys Asp Met Met Ser Glu Gly Gly Pro Pro Gly Ala Glu Pro 
1140 1145 1150 

Gin 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Thr Arg Thr Arg Ala Ala Leu Leu Leu Phe Thr Ala Leu Ala Thr 
is 10 15 

Ser Leu Gly Phe Asn Leu Asp Thr Glu Glu Leu Thr Ala Phe Arg Val 
20 25 30 
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Asp Ser Ala Gly Phe Gly Asp Ser Val Val Gin Tyr Ala Asn Ser Trp 
35 40 45 

Val Val Val Gly Ala Pro Gin Lys lie lie Ala Ala Asn Gin lie Gly 
50 55 60 

Glv Leu Tyr Gin Cys Gly Tyr Ser Thr Gly Ala Cys Glu Pro lie Gly 
65 70 75 80 

Leu Gin Val Pro Pro Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu 
85 90 95 

Ala Ser Thr Thr Ser Pro Ser Gin Leu Leu Ala Cys Gly Pro Thr Val 
100 105 110 

His Hie Glu Cys Gly Arg Asn Met Tyr Leu Thr Gly Leu Cys Phe Leu 
115 120 125 

Leu Cly Pro Thr Cln Leu Thr Gin Arg Leu Pro Val Ser Arg Gin Glu 
130 135 140 

Cys Pro Arg Cln Glu Gin Asp He Val Phe Leu He Asp Gly Ser Gly 
145 150 155 160 

Ser He Ser Ser Arg Asn Phe Ala Thr Met Met Asn Phe Val Arg Ala 
165 170 175 

Val II© Smr Cln Phe Gin Arg Pro Ser Thr Gin Phe Ser Leu Met Gin 
ISO 185 190 

Phe Ser Asn Lys Phe Gin Thr His Phe Thr Phe Glu Glu Phe Arg Arg 
195 200 205 

Thr Ser Asn Pro Leu Ser Leu Leu Ala Ser Val His Gin Leu Gin Gly 
210 215 220 

Phe Thr Tyr Thr Ala Thr Ala He Gin Asn Val Val His Arg Leu Phe 
225 230 235 240 

His Ala Ser Tyr Gly Ala Arg Arg Asp Ala He Lys He Leu He Val 
245 250 255 

He Thr Asp Gly Lys Lys Glu Gly Asp Ser Leu Asp Tyr Lys Asp Val 
260 265 270 

He Pro Met Ala Asp Ala Ala Gly He He Arg Tyr Ala He Gly Val 
275 280 285 

Gly Leu Ala Phe Gin Asn Arg Asn Ser Trp Lys Glu Leu Asn Asp He 
290 295 300 

Ala Ser Lys Pro Ser Gin Glu His He Phe Lys Val Glu Asp Phe Asp 
305 310 315 320 

Ala Leu Lys Asp He Gin Asn Gin Leu Lys Glu Lys He Phe Ala He 
325 330 335 

Glu Gly Thr Glu Thr He Ser Ser Ser Ser Phe Glu Leu Glu Met Ala 
340 345 350 
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Gin Glu Gly Phe Ser Ala Val Phe Thr Pro Asp Gly Pro Val Leu Gly 
355 360 365 

Ala Val Gly Ser Phe Thr Trp Ser Gly Gly Ala Phe Leu Tyr Pro Pro 
370 375 380 

Asn Met Ser Pro Thr Phe lie Asn Met Ser Gin Glu Asn Val Asp Met 
385 390 395 400 

Arg Asp Ser Tyr Leu Gly Tyr Ser Thr Glu Leu Ala Leu Trp Lys Gly 
405 410 415 

Val Gin Ser Leu Val Leu Gly Ala Pro Arg Tyr Gin His lie Gly Lys 
420 425 430 

Ala Val lie Phe He Gin Val Ser Arg Gin Trp Arg Met Lys Ala Glu 
435 440 445 

Val lie Gly Thr Gin He Gly Ser Tyr Phe Gly Ala Ser Leu Cys Ser 
450 455 460 

Val Asp Val Asp Thr Asp Gly Ser Thr Asp Leu Val Leu He Gly Ala 
465 470 475 480 

Pro His Tyr Tyr Glu Gin Thr Arg Gly Gly Gin Val Ser Val Cys Pro 
485 490 495 

Leu Pro Arg Gly Trp Arg Arg Trp Trp Cys Asp Ala Val Leu Tyr Gly 
500 505 510 

Glu Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu 
515 520 525 

Gly Asp Val Asn Gly Asp Lys Leu Thr Asp Val Val He Gly Ala Pro 
530 535 540 

Gly Glu Glu Glu Asn Arg Gly Ala Val Tyr Leu Phe His Gly Val Leu 
545 550 555 560 

Gly Pro Ser He Ser Pro Ser His Ser Gin Arg He Ala Gly Ser Gin 
565 570 575 

Leu Ser Ser Arg Leu Gin Tyr Phe Gly Gin Ala Leu Ser Gly Gly Gin 
580 585 590 

Asp Leu Thr Gin Asp Gly Leu Val Asp Leu Ala Val Gly Ala Aro Glv 
595 600 605 

Gin Val Leu Leu Leu Arg Thr Arg Pro Val Leu Trp Val Gly Val Ser 
610 615 620 

Met Gin Phe He Pro Ala Glu He Pro Arg Ser Ala Phe Glu Cys Ara 
625 630 635 640 

Glu Gin Val Val Ser Glu Gin Thr Leu Val Gin Ser Asn He Cys Leu 
645 650 655 

Tyr He Asp Lys Arg Ser Lys Asn Leu Leu Gly Ser Arg Asp Leu Gin 
660 665 670 
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Ser Ser Val Thr Leu Asp Leu Ala Leu Ala Pro Gly Arg Leu Ser Pro 
675 680 685 

Arg Ala lie Phe Gin Glu Thr Lye Asn Arg Ser Leu Ser Arg Val Arg 
690 695 700 

Val Leu Gly Leu Lys Ala His Cys Glu Asn Phe Asn Leu Leu Leu Pro 
705 710 715 720 

Ser Cys Val Glu Asp Ser Val He Pro He He Leu Arg Leu Asn Phe 
725 730 735 

Thr Leu Val Gly Lys Pro Leu Leu Ala Phe Arg Asn Leu Arg Pro Met 
740 745 750 

Leu Ala Ala Leu Ala Gin Arg Tyr Phe Thr Ala Ser Leu Pro Phe Glu 
755 760 765 

Lys Asn Cys Gly Ala Asp His He Cys Gin Asp Asn Leu Gly He Ser 
770 775 780 

Phe Ser Phe Pro Gly Leu Lys ser Leu Leu Val Gly Ser Asn Leu Glu 
785 790 795 800 

Leu Asn Ala Glu Val Met Val Trp Asn Asp Gly Glu Asp Ser Tyr Gly 
805 810 815 

Thr Thr He Thr Phe Ser His Pro Ala Gly Leu Ser Tyr Arg Tyr Val 
820 825 830 

Ala Glu Gly Gin Lys Gin Gly Gin Leu Arg Ser Leu His Leu Thr Cys 
835 840 845 

Cys Ser Ala Pro Val Gly Ser Gin Gly Thr Trp Ser Thr Ser Cys Arg 
850 855 860 

He Asn His Leu He Phe Arg Gly Gly Ala Gin He Thr Phe Leu Ala 
865 870 875 880 

Thr Phe Asp Val Ser Pro Lys Ala Val Gly Leu Asp Arg Leu Leu Leu 
885 890 895 

He Ala Asn Val Ser Ser Glu Asn Asn He Pro Arg Thr Ser Lys Thr 
900 905 910 

He Phe Gin Leu Glu Leu Pro Val Lys Tyr Ala Val Tyr He Val Val 
915 920 925 

Ser Ser His Glu Gin Phe Thr Lys Tyr Leu Asn Phe Ser Glu Ser Glu 
930 935 940 

Glu Lys Glu Ser His Val Ala Met His Arg Tyr Gin Val Asn Asn Leu 
945 950 955 960 

Gly Gin Arg Asp Leu Pro Val Ser He Asn Phe Trp Val Pro Val Glu 
965 970 975 

Leu Asn Gin Glu Ala Val Trp Met Asp Val Glu Val Ser His Pro Gin 
980 985 990 
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Asn Pro Ser Leu Arg Cys Ser Ser Glu Lys lie Ala Pro Pro Ala Ser 
995 1000 1005 

Asp Phe Leu Ala His lie Gin Lys Asn Pro Val Leu Asp Cys Ser He 
1010 1015 1020 

Ala Gly Cys Leu Arg Phe Arg Cys Asp Val Pro Ser Phe Ser Val Gin 
1025 1030 1035 1040 

Glu Glu Leu Asp Phe Thr Leu Lys Gly Asn Leu Ser Phe Gly Trp Val 
1045 1050 1055 

Arg Gin He Leu Gin Lys Lys Val Ser Val Val Ser Val Ala Glu He 
1060 1065 1070 

He Phe Asp Thr Ser Val Tyr Ser Gin Leu Pro Gly Gin Glu Ala Phe 
1075 1080 1085 

Met Arg Ala Cln Thr He Thr Val Leu Glu Lys Tyr Lys Val His Asn 
1090 1095 1100 

Pro He Pro Leu He Val Gly Ser Ser He Gly Gly Leu Leu Leu Leu 
1105 1110 1115 1120 

Ala Leu He Thr Ala Val Leu Tyr Lys Val Gly Phe Phe Lys Arg Gin 
1125 1130 1135 

Tyr Lys Glu Met Met Glu Glu Ala Asn Gly Gin He Ala Pro Glu Asn 
1140 1145 1150 

Gly Thr Gin Thr Pro Ser Pro Pro Ser Glu Lys 
1155 1160 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGy: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Phe Asn Leu Asp Val Glu Glu Pro Met Val Phe Gin 
15 10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TTYAAYYTGG AYGTNGARGA RCCNATGGTN TTYCA 
<2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TTCAACCTGG ACGTGGAGGA GCCCATGGTG TTCCAA 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 36 base pairs 
<B) TYPE: nucleic acid 

( C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TTCAACCTGG ACGTNGAASA NCCCATGGTC TTCCAA 36 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TTYAAYYTNG AYGTNGARGA RCC 23 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



BNSDCXID: <WO 9517412A1_I_> 



wo 95/17412 



PCT/US94/14832 



- 100 - 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTYAAyyTGG ACGTNGAAGA 20 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TGRAANACCA TNGGYTC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTGGAAGACC ATNGGYTC 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATTAACCCTC ACTAAAG 
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(2) INFORMATION FOR SEQ ID NO; 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AATACGACTC ACTATAG 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : I inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Val Phe Gin Glu Xaa Gly Ala Gly Phe Gly Gin 
1 5 ~ 10 

(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Leu Tyr Asp Xaa Val Ala Ala Thr Gly Leu Xaa Gin Pro lie 
15 10 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 17: 

Pro Leu Glu Tyr Xaa Asp Val lie Pro Gin Ala Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Phe Gin Glu Gly Phe Ser Xaa Val Leu Xaa 
15 10 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
<B) TYPE: amino acid 
< D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Thr Ser Pro Thr Phe lie Xaa Met Ser Gin Glu Asn Val Asp 
15 10 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Leu Val Val Gly Ala Pro Leu Glu Val Val Ala Val Xaa Gin Thr Glv 
i 5 10 15 

Arg 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: cunino acid 
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(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: peptide 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



Leu 
1 



Asp Xaa Lys Pro Xaa Asp Thr Ala 
5 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS ; s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Phe Gly Glu Gin Phe Ser Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : 6 ingle 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
RAANCCYTCY TGRAAACTYT C 21 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
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TTCAACCTGG ACGTGGAGGA GCCCATGGTG TTCAAGAGGA TGGAGCTGGC TTTGGACAGA 60 

GCGTGGCCCA GCTTGGCGGA TCTAGACTCG TGGTGGGAGC CCCCCTGGAG GTGGTGGCGG 120 

TCAACCAAAC AGGAAGGTTG TATGACTGTG TGGCTGCCAC TGGCCTTGTC AACCCATACC 180 

CCTGCACACA CCCCCAGATG CTGTGAACAT GTCCCTGGGT CTGTCCCTGT CAGCCGCCGC 240 

CAGTCGCCCC TGGCTGCTGG CCTGTGGCCC AACCATGCAC AGAGCCTGTG GGGAGAATAT 300 

GTATGCAGAA GGCTTTTGCC TCCTGTTGGA CTCCCATCTG CAGACCATTT GGACAGTACC 360 

TGCTGCCCTA CCAGAGTGTC CAAGTCAAGA GATGGACATT GTCTTCCTGA TTGATGGTTC 420 

TGGCAGTATG AGCAAAGTGA CTTTAAACAA ATGAAGGATT TGTGAGAGCT GTGATGGGAC 480 

AGTTTGAGGG CACCCAAACC CTGTTCTCAC TGATACAGTA TCCCACCTCC CTGAAGATCC 540 

ACTTCACCTT CACGCAATTC CAGAGCAGCT GGAACCCTCT GAGCCTGGTG GATCCCATTG 600 

TCCAACTGGA CGGCCTGACA TATACAGCCA CGGGCATCCG GAAAGTGGTG GAGGAACTGT 660 

TTCATAGTAA GAATGGGGCC CGTAAAAGTG CCAAGAAGAT CCTCATTGTC ATCACAGATG 720 

GCAAAAATAC AAAGACCCCC TGGAGTACGA GGACGTATCC CCAGGCAGAG AGAGCGGATC 780 

ATCCGCTATG CCATTGGGGT GGGAGATGCT TTCTGGAAAC CCAGTGCCAA GCAGGAGCTG 840 

GACAACATTG GCTCAGAGCC GGCTCAGGAC CATGTGTTCA GGGTGGACAA CTTTGCAGCA 900 

CTCAGCAGCA TCCAGGAGCA GCTGCAGGAG AAGATCTTTG CACTCGAAGG AACCCAGTC6 960 

ACGACAAGTA GCTCTTTCCA ACATGAGATG TTCCAAGAAG GGTTCA 1006 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GTNTTYCARG ARGAYGG 17 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CCACTGTCAG GATGCCCGTG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AGTTACGAAT TCGCCACCAT GGCTCTACGG GTGCTTCTTC TG 
(2) INFORMATION FOR SEQ ID NO: 28: 

< i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : S ingle 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
AGTTACGAAT TCGCCACCAT GACTCGGACT GTGCTTCTTC TG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

AGTTACGAAT TCGCCACCAT GACCTTCGGC ACTGTG 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
{ B ) TYPE : nuc leic ac id 

( C ) STRANDEDNESS : single 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3D: 
TTGCTGACTG CCTGCAGTTC 20 
(2) INFORMATION FOR SEQ ID NO: 31: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GTTCTGACGC GTAATGGCAT TGTAGACCTC GTCTTC 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTHS 36 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ACGTATGCAG GATCCCATCA AGAGATCJGAC ATCGCT 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
ACTGCATGTC TCGAGGCTGA AGCCTTCTTG GGACATC 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPEi DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TATAGACTGC TGGGTAGTCC CCAC 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TGAAGATTGG GGGTAAATAA CAGA 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3528 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1- ^3456 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GGC TGG GCC CTG GCT TCC TGT CAT GGG TCT AAC CTG GAT GTG GAG GAA 48 
Glv Trp Ala Leu Ala Ser Cys Hie Gly Ser Asn Leu Asp Val Glu Glu 
15 10 15 

CCC ATC GTG TTC AGA GAG GAT GCA GCC AGC TTT GGA CAG ACT GTG GTG 96 
Pro He Val Phe Arg Glu Asp Ala Ala Ser Phe Gly Gin Thr Val Val 
20 25 30 

CAG TTT GGT GGA TCT CGA CTC GTG GTG GGA GCC OCT CTG GAG GCG GTG 144 
Gin Phe Gly Gly Ser Arg Leu Val Val Gly Ala Pro Leu Glu Ala Val 
35 40 45 

GCA GTC AAC CAA ACA GGA CGG TTG TAT GAC TGT GCA CCT GCC ACT GGC 192 
Ala Val Asn Gin Thr Gly Arg Leu Tyr Asp Cys Ala Pro Ala Thr Gly 
50 55 60 
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ATG TGC CAG CCC ATC GTA CTG CGC AGT CCC CTA GAG GCA GTG AAC ATG 240 
Met Cys Gin Pro lie Val Leu Arg Ser Pro Leu Glu Ala Val Asn Met 
65 70 75 80 

TCC CTG GGC CTG TCT CTG GTG ACT GCC ACC AAT AAC GCC CAG TTG CTG 288 
Ser Leu Gly Leu Ser Leu Val Thr Ala Thr Asn Asn Ala Gin Leu Leu 
85 90 95 

GCT TGT GGT CCA ACT GCA CAG AGA GCT TGT GTG AAG AAC ATG TAT GCG 336 
Ala Cys Gly Pro Thr Ala Gin Arg Ala Cys Val Lys Asn Met Tyr Ala 
100 105 110 

AAA GGT TCC TGC CTC CTT CTC GGC TCC AGC TTG CAG TTC ATC CAG GCA 384 
Lys Gly Ser Cys Leu Leu Leu Gly Ser Ser Leu Gin Phe lie Gin Ala 
115 120 125 

GTC CCT GCC TCC ATG CCA GAG TGT CCA AGA CAA GAG ATG GAC ATT GCT 432 
Val Pro Ala Ser Met Pro Glu Cys Pro Arg Gin Glu Met Asp lie Ala 
130 135 140 

TTC CTG ATT GAT GGT TCT GGC AGC ATT AAC CAA AGG GAC TTT GCC CAG 480 
Phe Leu He Asp Gly Ser Gly Ser He Asn Gin Arg Asp Phe Ala Gin 
145 150 155 160 

ATG AAG GAC TTT GTC AAA GCT TTG ATG GGA GAG TTT GCG AGC ACC AGC 528 
Met Lys Asp Phe Val Lys Ala Leu Met Gly Glu Phe Ala Ser Thr Ser 
165 170 175 

ACC TTG TTC TCC CTG ATG CAA TAC TCG AAC ATC CTG AAG ACC CAT TTT 576 
Thr Leu Phe Ser Leu Met Gin Tyr Ser Asn He Leu Lys Thr His Phe 
180 185 190 

ACC TTC ACT GAA TTC AAG AAC ATC CTG GAC CCT CAG AGC CTG GTG GAT 624 
Thr Phe Thr Glu Phe Lys Asn He Leu Asp Pro Gin Ser Leu Val Asp 
195 200 205 

CCC ATT GTC CAG CTG CAA GGC CTG ACC TAC ACA GCC ACA GGC ATC CGG 672 
Pro He Val Gin Leu Gin Gly Leu Thr Tyr Thr Ala Thr Gly He Ara 
210 215 220 

ACA GTG ATG GAA GAG CTA TTT CAT AGC AAG AAT GGG TCC CGT AAA AGT 720 
Thr Val Met Glu Glu Leu Phe His Ser Lys Asn Gly Ser Arg Lvs Ser 
225 230 .235 240 

GCC AAG AAG ATC CTC CTT GTC ATC ACA GAT GGG CAG AAA TAC AGA GAC 768 
Ala Lys Lys He Leu Leu Val He Thr Asp Gly Gin Lys Tyr Arg Asp 
245 250 255 

CCC CTG GAG TAT AGT GAT GTC ATT CCC GCC GCA GAC AAA GCT GGC ATC 816 
Pro Leu Glu Tyr Ser Asp Val He Pro Ala Ala Asp Lys Ala Gly He 
260 265 270 

ATT CGT TAT GCT ATT GGG GTG GGA GAT GCC TTC CAG GAG CCC ACT GCC 864 
He Arg Tyr Ala He Gly Val Gly Asp Ala Phe Gin Glu Pro Thr Ala 
275 280 285 

CTG AAG GAG CTG AAC ACC ATT GGC TCA GCT CCC CCA CAG GAC CAC GTG 912 
Leu Lys Glu Leu Asn Thr He Gly Ser Ala Pro Pro Gin Asp His Val 
290 295 300 
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TTC AAG GTA GGC AAC TTT GCA GCA CTT CGC AGC ATC CAG AGG CAA CTT 960 
Phe Lvs Val Glv Asn Phe Ala Ala Leu Arg Ser lie Gin Arg Gin Leu 
305 ' ' 310 315 320 

CAG GAG AAA ATC TTC GCC ATT GAG GGA ACT CAA TCA AGG TCA AGT AGT 1008 
Gin Glu Lys lie Phe Ala lie Glu Gly Thr Gin Ser Arg Ser Ser Ser 
325 330 335 

TCC TTT CAG CAC GAG ATG TCA CAA GAA GGT TTC AGT TCA GCT CTC AfcA 1056 
Ser Phe Gin His Glu Met Ser Gin Glu Gly Phe Ser Ser Ala Leu Thr 
340 345 350 

TCG GAT GGA CCC GTT CTG GGG GCC GYG GGA AGC TTC AGC TGG TCC GGA 1104 
Ser Asp Gly Pro Val Leu Gly Ala Xaa Gly Ser Phe Ser Trp Ser Gly 
355 360 365 

GGT GCC TTC TTA TAT CCC CCA AAT ACG AGA CCC ACC TTT ATC AAC ATG 1152 
Gly Ala Phe Leu Tyr Pro Pro Asn Thr Arg Pro Thr Phe lie Asn Met 
370 375 380 

TCT CAG GAG 7VAT GTG GAC ATG AGA GAC TCC TAG CTG GGT TAC TCC ACC 1200 
Ser Gin Glu Asn Val Asp Met Arg Asp Ser Tyr Leu Gly Tyr Ser Thr 
385 390 395 400 

GCA GTG GCC TTT TGG AAG GGG GTT CAC AGC CTG ATC CTG GGG GCC CCG 1248 
Ala Val Ala Phe Trp Lys Gly Val His Ser Leu lie Leu Gly Ala Pro 
405 410 415 

CGT CAC CAG CAC ACG GGG AAG GTT GTC ATC TTT ACC CAG GAA GCC AGG 1296 
Arg His Gin His Thr Gly Lys Val Val lie Phe Thr Gin Glu Ala Arg 
420 425 430 

CAT TGG AGG CCC AAG TCT GAA GTC AGA GGG ACA CAG ATC GGC TCC TAC 1344 
His Trp Arg Pro Lys Ser Glu Val Arg Gly Thr Gin He Gly Ser Tyr 
435 440 445 

TTC GGG GCC TCT CTC TGT TCT GTG GAC GTG GAT AGA GAT GGC AGC ACY 1392 
Phe Gly Ala Ser Leu Cys Ser Val Asp Val Asp Arg Asp Gly Ser Xaa 
450 455 460 

GAC CTG GTC CTG ATC GGA GCC CCC CAT TAC TAT GAG CAG ACC CGA GGG 1440 
Asp Leu Val Leu He Gly Ala Pro His Tyr Tyr Glu Gin Thr Arg Gly 
465 470 475 480 

GGG CAG GTC TCA GTG TKC CCC GTG CCC GGT GTG AGG GGC AGG TGG CAG 1488 
Gly Gin Val Ser Val Xaa Pro Val Pro Gly Val Arg Gly Arg Trp Gin 
485 490 495 

TGT GAG GCC ACC CTC CAC GGG GAG CAG GRC CAT CCT TGG GGC CGC TTT 1536 
Cys Glu Ala Thr Leu His Gly Glu Gin Xaa His Pro Trp Gly Arg Phe 
500 505 510 

GGG GTG GCT CTG ACA GTG CTG GGG GAC GTA AAC GGG GAC AAT CTG GCA 1584 
Gly Val Ala Leu Thr Val Leu Gly Asp Val Asn Gly Asp Asn Leu Ala 
515 520 525 

GAC GTG GCT ATT GGT GCC CCT GGA GAG GAG GAG AGC AGA GGT GCT GTC 1632 
Asp Val Ala lie Gly Ala Pro Gly Glu Glu Glu Ser Arg Gly Ala Val 
530 535 540 
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TAG ATA TTT CAT GGA GCC TCG AG A CTG GAG ATC ATG CCC TCA CCC AGO 1680 
Tyr He Phe His Gly Ala Ser Arg Leu Glu He Met Pro Ser Pro Ser 
545 550 555 560 

CAG CGG GTC ACT GGC TCC CAG CTC TCC CTG AGA CTG CAG TAT TTT GGG 1728 
Gin Arg Val Thr Gly Ser Gin Leu Ser Leu Arg Leu Gin Tyr Phe Gly 
565 570 575 

CAG TCA TTG AGT GGG GGT CAG GAC CTT ACA CAG GAT GGC CTG GTG GAC 1776 
Gin ser Leu Ser Gly Gly Gin Asp Leu Thr Gin Asp Gly Leu Val Asp 
580 585 590 

CTG GCC GTG GGA GCC CAG GGG CAC GTA CTG CTG CTC AGG AGT CTG CCT 1824 
Leu Ala Val Gly Ala Gin Gly His Val Leu Leu Leu Arg Ser Leu Pro 
595 600 605 

CTG CTG AAA CTC CAC CTC TCC ATA AGA TTC GCC CCC ATG GAG GTG GCA 1872 
Leu Leu Lya Val Clu Leu Ser He Arg Phe Ala Pro Met Glu Val Ala 
610 615 620 

AAG GCT CTG TAC CAC TCC TGG GAA AGG ACT CCC ACT GTC CTC GAA GCT 1920 
Lys Ala Val Tyr Cln Cyc Trp Glu Arg Thr Pro Thr Val Leu Glu Ala 
"5 630 635 640 

GGA GAG GCC ACT CTC TCT CTC ACT GTC CAC AAA GGC TCA CCT GAC CTG 1968 
Gly Glu Ala Thr Val Cys Leu Thr Val His Lys Gly Ser Pro Asp Leu 
645 650 655 

TTA GGT AAT CTC CAA GGC TCT GTC AGG TAT GAT CTG GCG TTA GAT CCG 2016 
Leu Gly Asn Val Gin Gly Ser Val Arg Tyr Asp Leu Ala Leu Asp Pro 
660 665 670 

GGC CGC CTG ATT TCT CGT GCC ATT TTT GAT GAG ACT AAG AAC TCC ACT 2064 
Gly Arg Leu He Ser Arg Ala He Phe Asp Glu Thr Lys Asn Cys Thr 
675 680 685 

TTG ACG GGA AGG AAC ACT CTG GGG CTT GGT GAT CAC TGC GAA ACA GTG 2112 
Leu Thr Gly Arg Lys Thr Leu Gly Leu Gly Asp His Cys Glu Thr Val 
690 695 700 

AAG CTG CTT TTG CCG GAC TGT GTG GAA GAT GCA GTG AGC CCT ATC ATC 2160 
Lys Leu Leu Leu Pro Asp Cys Val Glu Asp Ala Val Ser Pro He He 
705 710 715 720 



CTG CGC CTC AAC TTT TCC CTG GTG AGA GAC TCT GCT TCA CCC AGG AAC 
Leu Arg Leu Asn Phe Ser Leu Val Arg Asp Ser Ala Ser Pro Aro Asn 



725 730 735 



2208 



CTG CAT CCT GTG CTG GCT GTG GGC TCA CAA GAC CAC ATA ACT GCT TCT 2256 
Leu His Pro Val Leu Ala Val Gly Ser Gin Asp His He Thr Ala Ser 
740 745 750 

CTG CCG TTT GAG AAG AAC TGT AAG CAA GAA CTC CTG TGT GAG GGG GAC 2304 
Leu Pro Phe Glu Lys Asn Cys Lys Gin Glu Leu Leu Cys Glu Gly Asp 
755 760 765 

CTG GGC ATC AGC TTT AAC TTC TCA GGC CTG CAG GTC TTG GTG GTG GGA 2352 
Leu Gly He Ser Pho Asn Phe Ser Gly Leu Gin Val Leu Val Val Glv 
770 775 780 
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GGC TCC CCA GAG CTC ACT GTG ACA GTC ACT GTG TGG AAT GAG GGT GAG 
Glv Ser Pro Glu Leu Thr Val Thr Val Thr Val Trp Asn Glu Gly Glu 
785 790 795 800 

GAC AGO TAT GGA ACT TTA GTC AAG TTC TAC TAG CCA GCA GGG CTA TCT 
ASD Ser Tyr Gly Thr Leu Val Lys Phe Tyr Tyr Pro Ala Gly Leu Ser 
805 810 815 

TAC CGA CGG GTA ACA GGG ACT CAG CAA CCT CAT CAG TAC CCA CTA CGC 
Tyr Arq Arq Val Thr Gly Thr Gin Gin Pro Hie Gin Tyr Pro Leu Arg 
^ 820 825 830 

TTG GCC TGT GAG GCT GAG CCC GCT GCC CAG GAG GAC CTG AGG AGC AGC 
Leu Ala Cys Glu Ala Glu Pro Ala Ala Gin Glu Asp Leu Arg Ser Ser 
835 840 845 

AGC TGT AGC ATT AAT CAC CCC ATC TTC CGA GAA GGT GCA AAG ACC ACC 
Ser Cys Ser lie Asn Hie Pro lie Phe Arg Glu Gly Ala Lys Thr Thr 
850 855 860 

TTC ATG ATC ACA TTC GAT GTC TCC TAC AAG GCC TTC CTA GGA GAC AGG 
Phe Met lie Thr Phe Asp Val Ser Tyr Lys Ala Phe Leu Gly Asp Arg 
865 870 875 880 

TTG CTT CTG AGG GCC AAA GCC AGC AGT GAG AAT AAT AAG CCT GAT ACC 
Leu Leu Leu Arg Ala Lys Ala Ser Ser Glu Asn Asn Lys Pro Asp Thr 
885 890 895 

AAC AAG ACT GCC TTC CAG CTG GAG CTC CCA GTG AAG TAC ACC GTC TAT 
Asn Lys Thr Ala Phe Gin Leu Glu Leu Pro Val Lys Tyr Thr Val Tyr 
900 905 910 

ACC CTG ATC AGT AGG CAA GAA GAT TCC ACC AAC CAT GTC AAC TTT TCA 
Thr Leu lie Ser Arg Gin Glu Asp Ser Thr Asn His Val Asn Phe Ser 
915 920 925 

TCT TCC CAC GGG GGG AGA AGG CAA GAA GCC GCA CAT CGC TAT CGT GTG 
Ser Ser His Gly Gly Arg Arg Gin Glu Ala Ala His Arg Tyr Arg Val 
930 935 940 

AAT AAC CTG AGT CCA CTG AAG CTG GCC GTC AGA GTT AAC TTC TGG GTC 
Asn Asn Leu Ser Pro Leu Lys Leu Ala Val Arg Val Asn Phe Trp Val 
945 950 955 960 

CCT GTC CTT CTG AAC GGT GTG GCT GTG TGG GAC GTG ACT CTG AGC AGC 
Pro Val Leu Leu Asn Gly Val Ala Val Trp Asp Val Thr Leu Ser Ser 
965 970 975 



2400 



2448 



2496 



2544 



2592 



2640 



2688 



2736 



2784 



2832 



2880 



2928 



CCA GCA CAG GGT GTC TCC TGC GTG TCC CAG ATG AAA CCT CCT CAG AAT 2976 
Pro Ala Gin Gly Val Ser Cys Val Ser Gin Met Lys Pro Pro Gin Asn 
980 985 990 

CCC GAC TTT CTG ACC CAG ATT CAG AGA CGT TCT GTG CTG GAC TGC TCC 3024 
Pro Asp Phe Leu Thr Gin lie Gin Arg Arg Ser Val Leu Asp Cys Ser 
995 1000 1005 

ATT GCT GAC TGC CTG CAC TCC CGC TGT GAC ATC CCC TCC TTG GAC ATC 3072 
lie Ala Asp Cys Leu His Ser Arg Cys Asp lie Pro Ser Leu Asp He 
1010 1015 1020 



BNSDOCID: <WO 9517412A1J_> 



wo 95/17412 



PCTAJS94/14832 



- 112 - 

CAG GAT GAA CTT GAC TTC ATT CTG AGG GGC AAC CTC AGC TTC GGC TGG 3120 
Gin Asp Glu Leu Asp Phe lie Leu Arg Gly Aen Leu Ser Phe Glv Trp 
1025 1030 1035 1040 

GTC AGT CAG ACA TTG CAG GAA AAG GTG TTG CTT GTG AGT GAG GCT GAA 3168 
Val Ser Gin Thr Leu Gin Glu Lye Val Leu Leu Val Ser Glu Ala Glu 
1045 1050 1055 

ATC ACT TTC GAC ACA TCT GTG TAC TCC CAG CTG CCA GGA CAG GAG GCA 3216 
lie Thr Phe Asp Thr Ser Val Tyr ser Gin Leu Pro Gly Gin Glu Ala 
1060 1065 1070 

TTT CTG AGA GCC CAG GTG GAG ACA ACG TTA GAA GAA TAC GTG GTC TAT 3264 
Phe Leu Arg Ala Gin Val Glu Thr Thr Leu Glu Glu Tyr Val Val Tvr 
1075 1080 1085 

GAG CCC ATC TTC CTC GTG GCG GGC AGC TCG GTG GGA GGT CTG CTG TTA 3312 
Glu Pro lie Phe Leu Val Ala Gly Ser Ser Val Gly Gly Leu Leu Leu 
1090 1095 1100 

CTG GCT CTC ATC ACA GTG GTA CTG TAC AAG CTT GGC TYC TYC AAA CGT 3360 
Leu Ala Leu lie Thr Val Val Leu Tyr Lys Leu Gly Xaa Xaa Lys Aro 
1105 1110 1115 1120 

CAG TAC AAA GAA ATG CTG GAC GGC AAG GCT GCA GAT CCT GTC ACA GCC 3408 
Gin Tyr Lys Glu Met Leu Asp Gly Lys Ala Ala Asp Pro Val Thr Ala 
1125 1130 1135 

GGC CAG GCA GAT TTC GGC TGT GAG ACT CCT CCA TAT CTC GTG AGC TAGGAATCCA 3463 
Gly Gin Ala Asp Phe Gly Cys Glu Thr Pro Pro Tyr Leu Val ser 
1140 1145 1150 

CTCTCCTGCC TATCTCTGNA ATGAAGATTG GTCCTGCCTA TGAGTCTACT GGCATGGGAA 3523 
^^'^^^ 3528 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Gly Trp Ala Leu Ala Ser Cys His Gly Ser Asn Leu Asp Val Glu Glu 
1 5 10 15 

Pro lie Val Phe Arg Glu Asp Ala Ala Ser Phe Gly Gin Thr Val Val 
20 25 30 

Gin Phe Gly Gly Ser Arg Leu Val Val Gly Ala Pro Leu Glu Ala Val 
35 40 45 

Ala Val Asn Gin Thr Gly Arg Leu Tyr Asp Cys Ala Pro Ala Thr Glv 
50 55 60 
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Met Cys Gin Pro lie Val Leu Arg Ser Pro Leu Glu Ala Val Asn Met 
65 70 75 80 

Ser Leu Gly Leu Ser Leu Val Thr Ala Thr Asn Asn Ala Gin Leu Leu 
85 90 95 

Ala Cys Gly Pro Thr Ala Gin Arg Ala Cys Val Lys Asn Met Tyr Ala 
100 105 110 

Lys Gly Ser Cys Leu Leu Leu Gly Ser Ser Leu Gin Phe lie Gin Ala 
115 120 125 

Val Pro Ala Ser Met Pro Glu Cys Pro Arg Gin Glu Met Asp He Ala 
130 135 140 

Phe Leu He Asp Gly Ser Gly Ser He Asn Gin Arg Asp Phe Ala Gin 
145 150 155 160 

Met Lys Asp Phe Val Lys Ala Leu Met Gly Glu Phe Ala Ser Thr Ser 
165 170 175 

Thr Leu Phe Ser Leu Met Gin Tyr Ser Asn He Leu Lys Thr His Phe 
180 185 190 

Thr Phe Thr Glu Phe Lys Asn He Leu Asp Pro Gin Ser Leu Val Asp 
195 200 205 

Pro He Val Gin Leu Gin Gly Leu Thr Tyr Thr Ala Thr Gly He Arg 
210 215 220 

Thr Val Met Glu Glu Leu Phe His Ser Lys Asn Gly Ser Arg Lys Ser 
225 230 235 240 

Ala Lys Lys He Leu Leu Val He Thr Asp Gly Gin Lys Tyr Arg Asp 
245 250 255 

Pro Leu Glu Tyr Ser Asp Val He Pro Ala Ala Asp Lys Ala Gly He 
260 265 270 

He Arg Tyr Ala He Gly Val Gly Asp Ala Phe Gin Glu Pro Thr Ala 
275 280 285 

Leu Lys Glu Leu Asn Thr He Gly Ser Ala Pro Pro Gin Asp His Val 
290 295 300 

Phe Lys Val Gly Asn Phe Ala Ala Leu Arg Ser He Gin Arg Gin Leu 
305 310 315 320 

Gin Glu Lys He Phe Ala He Glu Gly Thr Gin Ser Arg Ser Ser Ser 
325 330 335 

Ser Phe Gin His Glu Met Ser Gin Glu Gly Phe Ser Ser Ala Leu Thr 
340 345 350 

Ser Asp Gly Pro Val Leu Gly Ala Xaa Gly Ser Phe Ser Trp Ser Gly 
355 360 365 

Gly Ala Phe Leu Tyr Pro Pro Asn Thr Arg Pro Thr Phe He Asn Met 
370 375 380 
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Ser Gin Glu Asn Val Asp Met Arg Asp Ser Tyr Leu Gly Tyr Ser Thr 
385 390 395 400 

Ala Val Ala Phe Trp Lys Gly Val His Ser Leu lie Leu Gly Ala Pro 
405 410 415 

Arg His Gin His Thr Gly Lys Val Val He Phe Thr Gin Glu Ala Ara 
420 425 430 

His Trp Arg Pro Lys Ser Glu Val Arg Gly Thr Gin He Gly Ser Tvr 
435 440 445 ^ 

Phe Gly Ala Ser Leu Cys Ser Val Asp Val Asp Arg Asp Gly Ser Xaa 
450 455 460 

Afip Leu Val Leu He Gly Ala Pro His Tyr Tyr Glu Gin Thr Arc Glv 
465 470 475 ^ 480 

Gly Gin Val Ser Val Xaa Pro Val Pro Gly Val Arg Gly Arg Trp Gin 
485 490 495 

Cys Glu Ala Thr Leu His Gly Glu Gin Xaa His Pro Trp Gly Ara Phe 
500 505 510 

Gly Val Ala Leu Thr Val Leu Gly Asp Val Asn Gly Asp Asn Leu Ala 
515 520 * 525 

Asp Val Ala He Gly Ala Pro Gly Glu Glu Glu Ser Arg Gly Ala Val 
530 535 540 

Tyr He Phe His Gly Ala Ser Arg Leu Glu He Met Pro Ser Pro Ser 

550 555 ' 560 

Gin Arg Val Thr Gly Ser Gin Leu Ser Leu Arg Leu Gin Tyr Phe Glv 
565 570 575 

Gin Ser Leu Ser Gly Gly Gin Asp Leu Thr Gin Asp Gly Leu Val Aso 
580 585 590 

Leu Ala Val Gly Ala Gin Gly His Val Leu Leu Leu Arc Ser Leu Pro 
595 600 605 

Leu Leu Lys Val Glu Leu Ser He Arg Phe Ala Pro Met Glu Val Ala 

615 620 

Lys Ala Val Tyr Gin Cys Trp Glu Arg Thr Pro Thr Val Leu Glu Ala 

"0 635 640 

Gly Glu Ala Thr Val Cys Leu Thr Val His Lys Gly Ser Pro Asp Leu 
645 650 655 

Leu Gly Asn Val Gin Gly Ser Val Arg Tyr Asp Leu Ala Leu Asp Pro 
660 665 670 

Gly Arg Leu He Ser Arg Ala He Phe Asp Glu Thr Lys Asn Cys Thr 
675 680 685 

Leu Thr Gly Arg Lys Thr Leu Gly Leu Gly Asp His Cys Glu Thr Val 
690 695 700 
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Lvs Leu Leu Leu Pro Asp Cys Val Glu Asp Ala Val Ser Pro lie lie 
705 710 715 720 

Leu Arg Leu Asn Phe Ser Leu Val Arg Asp Ser Ala Ser Pro Arg Asn 
725 730 735 

Leu His Pro Val Leu Ala Val Gly Ser Gin Asp His lie Thr Ala Ser 
740 745 750 

Leu Pro Phe Glu Lys Asn Cys Lys Gin Glu Leu Leu Cys Glu Gly Asp 
755 760 765 

Leu Gly lie Ser Phe Asn Phe Ser Gly Leu Gin Val Leu Val Val Gly 
770 775 780 

Gly Ser Pro Glu Leu Thr Val Thr Val Thr Val Trp Asn Glu Gly Glu 
785 790 795 800 

Asp Ser Tyr Gly Thr Leu Val Lys Phe Tyr Tyr Pro Ala Gly Leu Ser 
805 810 815 

Tvr Aro Arg Val Thr Gly Thr Gin Gin Pro His Gin Tyr Pro Leu Arg 
^ 820 825 830 

Leu Ala Cys Glu Ala Glu Pro Ala Ala Gin Glu Asp Leu Arg Ser Ser 
835 840 845 

Ser Cys Ser lie Asn His Pro He Phe Arg Glu Gly Ala Lys Thr Thr 
850 855 860 

Phe Met lie Thr Phe Asp Val Ser Tyr Lys Ala Phe Leu Gly Asp Arg 
865 870 875 880 

Leu Leu Leu Arg Ala Lys Ala Ser Ser Glu Asn Asn Lys Pro Asp Thr 
885 890 895 

Asn Lys Thr Ala Phe Gin Leu Glu Leu Pro Val Lys Tyr Thr Val Tyr 
900 905 910 

Thr Leu He Ser Arg Gin Glu Asp Ser Thr Asn His Val Asn Phe Ser 
915 920 925 

Ser Ser His Gly Gly Arg Arg Gin Glu Ala Ala His Arg Tyr Arg Val 
930 935 940 

Asn Asn Leu Ser Pro Leu Lys Leu Ala Val Arg Val Asn Phe Trp Val 
945 950 955 960 

Pro Val Leu Leu Asn Gly Val Ala Val Trp Asp Val Thr Leu Ser Ser 
965 970 975 

Pro Ala Gin Gly Val Ser cys Val Ser Gin Met Lys Pro Pro Gin Asn 
980 985 990 

Pro Asp Phe Leu Thr Gin He Gin Arg Arg Ser Val Leu Asp Cys Ser 
995 1000 1005 

He Ala Asp Cys Leu His Ser Arg Cys Asp He Pro Ser Leu Asp He 
1010 1015 1020 
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Gln Asp Glu Leu Asp Phe lie Leu Arg Gly Asn Leu Ser Phe Gly Trp 
1025 1030 1035 1040 

Val Ser Gin Thr Leu Gin Glu Lys Val Leu Leu Val Ser Glu Ala Glu 
1045 1050 1055 

lie Thr Phe Asp Thr Ser Val Tyr Ser Gin Leu Pro Gly Gin Glu Ala 
1060 1065 1070 

Phe Leu Arg Ala Gin Val Glu Thr Thr Leu Glu Glu Tyr Val Val Tyr 
1075 1080 1085 

Glu Pro lie Phe Leu Val Ala Gly Ser Ser Val Gly Gly Leu Leu Leu 
1090 1095 1100 

Leu Ala Leu lie Thr Val Val Leu Tyr Lys Leu Gly Xaa Xaa Lys Arg 
ii05 1110 1115 1120 

Gin Tyr Lys Glu Met Leu Asp Gly Lys Ala Ala Asp Pro Val Thr Ala 
1125 1130 1135 

Gly Gin Ala Asp Phe Gly Cys Glu Thr Pro Pro Tyr Leu Val Ser 
1140 1145 1150 

<2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GTCCAAGCTG TCATGGGCCA G 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GTCCAGCAGA CTGAAGAGCA CGG 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
TGTAAAACGA CGGCCAGT 18 
<2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GGAAACAGCT ATGACCATG 19 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GGACATGTTC ACTGCCTCTA GG 22 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GGCGGACAGT CAGACGACTG TCCTG 25 
(2) INFORMATION FOR SEQ ID NO: 44: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOCy: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
CTGGTTCGGC CCACCTCTGA AGGTTCCAGA ATCGATAG 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 52.. 3519 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 45: 

GCTTTCTGAA GGTTCCAGAA TCGATAGTGA ATTCGTGGGC ACTGCTCAGA T ATG GTC 57 

Met Val 
1 

CGT GGA GTT GTG ATC CTC CTG TGT GGC TGG GCC CTG GCT TCC TGT CAT IDS 
Arg Gly Val Val He Leu Leu Cys Gly Trp Ala Leu Ala Ser Cys His 
5 10 15 

GGG TCT AAC CTG GAT GTG GAG AAG CCC GTC GTG TTC AAA GAG GAT GCA 153 
Gly Ser Asn Leu Asp Val Glu Lys Pro Val Val Phe Lys Glu Asp Ala 

25 30 

GCC AGC TTC GGA CAG ACT GTG GTG CAG TTT GGT GGA TCT CGA CTC GTG 201 
Ala Ser Phe Gly Gin Thr Val Val Gin Phe Gly Gly Ser Arg Leu Val 

vl? r?^ t^^ ^'^^ CAA ACA GGA CAG TCG 249 

Val Gly Ala Pro Leu Glu Ala Val Ala Val Asn Gin Thr Gly Gin Ser 
55 60 65 

TCT GAC TGT CCG CCT GCC ACT GGC GTG TGC CAG CCC ATC TTA CTG CAC 297 
Ser Asp Cya Pro Pro Ala Thr Gly Val Cys Gin Pro He Leu Leu His 
70 75 QQ 

ATT CCC CTA GAG GCA GTG AAC ATG TCC CTG GGC CTG TCT CTG GTG GCT 345 
He Pro Leu Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu Val Ala 
85 90 95 
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GAC ACC AAT AAC TCC CAG TTG CTG GCT TGT GGT CCA ACT GCA CAG AGA 393 
Afio Thr Asn Asn Ser Gin Leu Leu Ala Cys Gly Pro Thr Ala Gin Arg 
* 100 105 iiu 

GCT TGT GCA AAG AAC ATG TAT GCA AAA GGT TCC TGC CTC CTT CTG GGC 441 
Ala cys Ala Lys Asn Met Tyr Ala Lys Gly Ser Cys Leu Leu Leu Gly 
115 120 125 130 

TCC AGC TTG CAG TTC ATC CAG GCA ATC CCT GCT ACC ATG CCA GAG TGT 489 
ser ser Leu Gin Phe lie Gin Ala lie Pro Ala Thr Met Pro Glu Cys 
135 

CCA GGA CAA GAG ATG GAC ATT GCT TTC CTG ATT GAT GGC TCC GGC AGC 537 
Pro Gly Gin Glu Met Asp lie Ala Phe Leu lie Asp Gly Ser Gly Ser 
150 155 160 

ATT GAT CAA AGT GAC TTT ACC CAG ATG AAG GAC TTC GTC AAA GCT TTG 585 
He Asp Gin Ser Asp Phe Thr Gin Met Lys Asp Phe Val Lys Ala Leu 
165 170 175 

ATG GGC CAG TTG GCG AGC ACC AGC ACC TCG TTC TCC CTG ATG CAA TAC 633 
Met Glv Gin Leu Ala Ser Thr Ser Thr Ser Phe Ser Leu Met Gin Tyr 
180 185 190 

TCA AAC ATC CTG AAG ACT CAT TTT ACC TTC ACG GAA TTC AAG AGC AGC 681 
ser Asn He Leu Lys Thr His Phe Thr Phe Thr Glu Phe Lys Ser Ser 
195 200 205 210 

CTG AGC CCT CAG AGC CTG GTG GAT GCC ATC GTC CAG CTC CAA GGC CTG 729 
Leu ser Pro Gin Ser Leu Val Asp Ala He Val Gin Leu Gin Gly Leu 
215 220 225 

ACG TAC ACA GCC TCG GGC ATC CAG AAA GTG GTG AAA GAG CTA TTT CAT 777 
Thr Tyr Thr Ala Ser Gly He Gin Lys Val Val Lys Glu Leu Phe His 
230 235 240 

AGC AAG AAT GGG GCC CGA AAA AGT GCC AAG AAG ATA CTA ATT GTC ATC 825 
ser Lys Asn Gly Ala Arg Lys Ser Ala Lys Lys He Leu He Val He 
245 250 255 

ACA GAT GGG CAG AAA TTC AGA GAC CCC CTG GAG TAT AGA CAT GTC ATC 873 
Thr Asp Gly Gin Lys Phe Arg Asp Pro Leu Glu Tyr Arg His Val He 
260 265 270 

CCT GAA GCA GAG AAA GCT GGG ATC ATT CGC TAT GCT ATA GGG GTG GGA 921 
Pro Glu Ala Glu Lys Ala Gly He He Arg Tyr Ala He Gly Val Gly 
275 280 285 290 

GAT GCC TTC CGG GAA CCC ACT GCC CTA CAG GAG CTG AAC ACC ATT GGC 969 
Asp Ala Phe Arg Glu Pro Thr Ala Leu Gin Glu Leu Asn Thr He Gly 
295 300 305 

TCA GCT CCC TCG CAG GAC CAC GTG TTC AAG GTG GGC AAT TTT GTA GCA 1017 
Ser Ala Pro Ser Gin Asp His Val Phe Lys Val Gly Asn Phe Val Ala 
310 315 320 

CTT CGC AGC ATC CAG CGG CAA ATT CAG GAG AAA ATC TTT GCC ATT GAA 1065 
Leu Arg Ser He Gin Arg Gin He Gin Glu Lys He Phe Ala He Glu 
325 330 335 
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GGA ACC GAA TCA AGO TCA AGT AGT TCC TTT CAG CAC GAG ATG TCA CAA 1113 
Gly Thr Glu Ser Arg Ser Ser Ser Ser Phe Gin His Glu Met Ser Gin 
340 345 350 

GAA GGT TTC AGO TCA GCT CTC TCA ATG GAT GGA CCA GTT CTG GGG GCT 1161 
Glu Gly Phe Ser Ser Ala Leu Ser Met Asp Gly Pro Val Leu Gly Ala 
355 ^ 360 365 370 

GTG GGA GGC TTC AGC TGG TCT GGA GGT GCC TTC TTG TAC CCC TCA AAT 1209 
Val Gly Gly Phe ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro Ser Aon 
375 380 385 

ATG AGA TCC ACC TTC ATC AAC ATG TCT CAG GAG AAC GAG GAT ATG AGG 1257 
Met Arg Ser Thr Phe lie Asn Met Ser Gin Glu Aan Glu Asp Met Arg 
390 395 400 

GAC GCT TAC CTG GGT TAC TCC ACC GCA CTG GCC TTT TGG AAG GGG GTC 1305 
Asp Ala Tyr Leu Gly Tyr Ser Thr Ala Leu Ala Phe Trp Lys Gly Val 
405 410 415 

CAC AGC CTG ATC CTG GGG GCC CCT CGC CAC CAG CAC ACG GGG AAG GTT 1353 
His Ser Leu lie Leu Gly Ala Pro Arg His Gin His Thr Gly Lys Val 
.420 425 430 

GTC ATC TTT ACC CAG GAA TCC AGG CAC TGG AGG CCC AAG TCT GAA GTC 1401 
Val lie Phe Thr Gin Glu Ser Arg His Trp Arg Pro Lys Ser Glu Val 
435 440 445 450 

AGA GGG ACA CAG ATC GGC TCC TAC TTT GGG GCA TCT CTC TGT TCT GTG 1449 
Arg Gly Thr Gin lie Gly Ser Tyr Phe Gly Ala Ser Leu Cys Ser Val 
455 460 465 

GAC ATG GAT AGA GAT GGC AGC ACT GAC CTG GTC CTG ATT GGA GTC CCC 1497 
Asp Met Asp Arg Asp Gly Ser Thr Asp Leu Val Leu He Gly Val Pro 
470 475 480 

CAT TAC TAT GAG CAC ACC CGA GGG GGG CAG GTG TCG GTG TGC CCC ATG 1545 
His Tyr Tyr Glu His Thr Arg Gly Gly Gin Val Ser Val Cys Pro Met 
485 490 495 

CCT GGT GTG AGG AGC AGG TGG CAT TGT GGG ACC ACC CTC CAT GGG GAG 1593 
Pro Gly Val Arg Ser Arg Trp His Cys Gly Thr Thr Leu His Gly Glu 
5O0 505 510 

CAG GGC CAT CCT TGG GGC CGC TTT GGG GCG GCT CTG ACA GTG CTA GGG 1641 
Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu Gly 
515 520 525 530 

GAC GTG AAT GGG GAC AGT CTG GCG GAT GTG GCT ATT GGT GCA CCC GGA 1689 
Asp Val Asn Gly Asp Ser Leu Ala Asp Val Ala He Gly Ala Pro Gly 
535 540 545 

GAG GAG GAG AAC AGA GGT GCT GTC TAC ATA TTT CAT GGA GCC TCG AGA 1737 
Glu Glu Glu Asn Arg Gly Ala Val Tyr He Phe His Gly Ala Ser Arg 
550 555 560 

CAG GAC ATC GCT CCC TCG CCT AGC CAG CGG GTC ACT GGC TCC CAG CTC 1785 
Gin Asp He Ala Pro Ser Pro Ser Gin Arg Val Thr Gly Ser Gin Leu 
565 570 575 
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TTC CTG AGG CTC CAA TAT TTT GGG CAG TCA TTA AGT GGG GGT GAG GAG 1833 
Phe Leu Arg Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly Gin Asp 
580 585 590 

CTT ACA CAG GAT GGC CTG GTG GAC CTG GCC GTG GGA GCC CAG GGG CAC 1881 
Leu Thr Gin Asp Gly Leu Val Asp Leu Ala Val Gly Ala Gin Gly His 
595 600 605 610 

GTG CTG CTG CTT AGG AGT CTG CCT TTG CTG AAA GTG GGG ATC TCC ATT 1929 
Val Leu Leu Leu Arg Ser Leu Pro Leu Leu Lys Val Gly lie Ser lie 
615 620 625 

AGA TTT GCC CCC TCA GAG GTG GCA AAG ACT GTG TAC CAG TGC TGG GGA 1977 
Aro Phe Ala Pro Ser Glu Val Ala Lys Thr Val Tyr Gin Cys Trp Gly 
^ 630 635 640 

AGG ACT CCC ACT CTC CTC GAA GCT GGA GAG GCC ACC GTC TGT CTC ACT 2025 
Arg Thr Pro Thr Val Leu Glu Ala Gly Glu Ala Thr Val Cys Leu Thr 
645 650 655 

GTC CGC AAA CCT TCA CCT GAC CTG TTA GGT GAT GTC CAA AGC TCT GTC 2073 
Val Arc Lys Gly Ser Pro Asp Leu Leu Gly Asp Val Gin Ser Ser Val 
660 665 670 

AGG TAT GAT CTC GCG TTC GAT CCG GGC CGT CTG ATT TCT CGT GCC ATT 2121 
Arg Tyr Asp Leu Ala Leu Asp Pro Gly Arg Leu lie Ser Arg Ala lie 
675 680 685 690 

TTT GAT GAG ACC AAG AAC TGC ACT TTG ACC CGA AGG AAG ACT CTG GGG 2169 
Phe Asp Glu Thr Lys Asn Cys Thr Leu Thr Arg Arg Lys Thr Leu Gly 
695 700 705 

CTT GGT GAT CAC TGC GAA ACA ATG AAG CTG CTT TTG CCA GAC TGT GTG 2217 
Leu Gly Asp His Cys Glu Thr Met Lys Leu Leu Leu Pro Asp Cys Val 
710 715 720 

GAG GAT GCA GTC ACC CCT ATC ATC CTG CGC CTT AAC TTA TCC CTG CCA 2265 
Glu Asp Ala Val Thr Pro He He Leu Arg Leu Asn Leu Ser Leu Ala 
725 730 735 

GGG GAC TCT GCT CCA TCC AGG AAC CTT CGT CCT GTG CTG GCT GTG GGC 2313 
Gly Asp Ser Ala Pro Ser Arg Asn Leu Arg Pro Val Leu Ala Val Gly 
740 745 750 

TCA CAA GAC CAT GTA ACA GCT TCT TTC CCG TTT GAG AAG AAC TGT GAG 2361 
Ser Gin Asp His Val Thr Ala Ser Phe Pro Phe Glu Lys Asn Cys Glu 
755 760 765 770 

GGG AAC CTG GGC GTC AGC TTC AAC TTC TCA GGC CTG CAG GTC TTG GAG 2409 
Gly Asn Leu Gly Val Ser Phe Asn Phe Ser Gly Leu Gin Val Leu Glu 
775 780 785 

GTA GGA AGC TCC CCA GAG CTC ACT GTG ACA GTA ACA GTT TGG AAT GAG 2457 
Val Gly Ser Ser Pro Glu Leu Thr Val Thr Val Thr Val Trp Asn Glu 
790 795 800 

GGT GAG GAC AGC TAT GGA ACC TTA ATC AAG TTC TAC TAC CCA GCA GAG 2505 
Gly Glu Asp Ser Tyr Gly Thr Leu He Lys Phe Tyr Tyr Pro . Ala Glu 
805 810 815 
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CTA TCT TAC CGA CGG GTG ACA AGA GCC CAG CAA CCT CAT CCG TAC CCA 2553 
Leu Ser Tyr Arg Arg Val Thr Arg Ala Gin Gin Pro His Pro Tyr Pro 
820 825 830 

CTA CGC CTG GCA TGT GAG GCT GAG CCC ACQ GGC CAG GAG AGC CTG AGG 2601 
Leu Arg Leu Ala Cys Glu Ala Glu Pro Thr Gly Gin Glu Ser Leu Ara 
835 840 845 850 

AGC AGC AGC TGT AGC ATC AAT CAC CCC ATC TTC CGA GAA GGT GCC AAG 2649 
Ser Ser Ser Cys Ser lie Asn His Pro lie Phe Arg Glu Gly Ala Lys 
855 860 865 

GCC ACC TTC ATG ATC ACA TTT GAT GTC TCC TAC AAG GCC TTC CTG GGA 2697 
Ala Thr Phe Met lie Thr Phe Asp Val Ser Tyr Lys Ala Phe Leu Gly 
870 875 " 880 

GAC AGG TTG CTT CTG AGG GCC AGC GCA AGC AGT GAG AAT AAT AAG CCT 2745 
Asp Arg Leu Leu Leu Arg Ala Ser Ala Ser Ser Glu Asn Asn Lvs Pro 
885 890 895 

GAA ACC AGC AAG ACT GCC TTC CAG CTG GAG CTT CCG GTG AAG TAC ACG 2793 
Glu Thr Ser Lys Thr Ala Phe Gin Leu Glu Leu Pro Val Lys Tvr Thr 
900 905 910 

GTC TAT ACC GTG ATC AGT AGG CAG GAA GAT TCT ACC AAG CAT TTC AAC 2841 
Val Tyr Thr Val lie Ser Arg Gin Glu Asp Ser Thr Lys His Phe Asn 

920 925 930 

TTC TCA TCT TCC CAC GGG GAG AGA CAG AAA GAG GCC GAA CAT CGA TAT 2889 
Phe ser Ser Ser His Gly Glu Arg Gin Lys Glu Ala Glu His Ara Tvr 
935 940 945 ^ 

CGT GTG AAT AAC CTG AGT CCA TTG ACG CTG GCC ATC AGC GTT AAC TTC 2937 
Arg Val Asn Asn Leu Ser Pro Leu Thr Leu Ala lie Ser Val Asn Phe 
950 955 960 

TGG GTC CCC ATC CTT CTG AAT GGT GTG GCC GTG TGG GAT GTG ACT CTG 2985 
Trp Val Pro He Leu Leu Asn Gly Val Ala Val Trp Asp Val Thr Leu 
965 970 975 

AGG AGC CCA GCA CAG GGT GTC TCC TGT GTG TCA CAG AGG GAA CCT CCT 3033 
ffS ^^"^ Val Ser Gin Arg Glu Pro Pro 

980 985 990 

CAA CAT TCC GAC CTT CTG ACC CAG ATC CAA GGA CGC TCT GTG CTG GAC 3081 
Gin Hxs Ser Asp Leu Leu Thr Gin He Gin Gly Arg Ser Val Leu Asp 

1000 1005 1010 

TGC GCC ATC GCC GAC TGC CTG CAC CTC CGC TGT GAC ATC CCC TCC TTG 3129 
Cys Ala He Ala Asp Cys Leu His Leu Arg Cys Asp He Pro Ser Leu 
1015 1020 1025 

GGC ACC CTG GAT GAG CTT GAC TTC ATT CTG AAG GGC AAC CTC AGC TTC 3177 
Gly Thr Leu Asp Glu Leu Asp Phe He Leu Lys Gly Asn Leu Ser Phe 
1030 1035 1040 

GGC TGG ATC AGT CAG ACA TTG CAG AAA AAG GTG TTG CTC CTG AGT GAG 3225 
Gly Trp He Ser Gin Thr Leu Gin Lys Lye Val Leu Leu Leu Ser Glu 
1045 1050 1055 
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GCT GAA ATC ACA TTC AAC ACA TCT GTG TAT TCC CAG CTG CCG GGA CAG 3273 
Ala Glu He Thr Phe Asn Thr Ser Val Tyr Ser Gin Leu Pro Gly Gin 
1060 1065 lOvo 

GAG GCA TTT CTG AGA GCC CAG GTG TCA ACG ATG CTA GAA GAA TAG GTG 3321 
Glu Ala Phe Leu Arg Ala Gin Val Ser Thr Met Leu Glu Glu Tyr Val 
1075 1080 1085 1090 

GTC TAT GAG CCC GTC TTC CTC ATG GTG TTC AGC TCA GTG GGA GGT CTG 3369 
Val Tyr Glu Pro Val Phe Leu Met Val Phe Ser Ser Val Gly Gly Leu 
1095 1100 1105 

CTG TTA CTG GCT CTC ATC ACT GTG GCG CTG TAG AA6 CTT GGC TTC TTC 3417 
Leu Leu Leu Ala Leu He Thr Val Ala Leu Tyr Lys Leu Gly Phe Phe 
1110 1115 1120 

AAA CGT CAG TAT AAA GAG ATG CTG GAT CTA CCA TCT GCA GAT CCT GAC 3465 
Lys Arq Gin Tyr Lys Glu Met Leu Asp Leu Pro Ser Ala Asp Pro Asp 
1125 1130 1135 

CCA GCC GGC CAG GCA GAT TCC AAC CAT GAG ACT CCT CCA CAT CTC ACG 3513 
Pro Ala Gly Gin Ala Asp Ser Asn His Glu Thr Pro Pro His Leu Thr 
1140 1145 1150 

TCC TAG 3519 

Ser 

1155 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Val Arg Gly Val Val lie Leu Leu Cys Gly Trp Ala Leu Ala Ser 
15 10 15 

Cys His Gly Ser Asn Leu Asp Val Glu Lys Pro Val Val Phe Lys Glu 
20 25 30 

Asp Ala Ala Ser Phe Gly Gin Thr Val Val Gin Phe Gly Gly Ser Arg 
35 40 45 

Leu Val Val Gly Ala Pro Leu Glu Ala Val Ala Val Asn Gin Thr Gly 
50 55 60 

Gin Ser Ser Asp Cys Pro Pro Ala Thr Gly Val Cys Gin Pro lie Leu 
65 70 75 80 

Leu His lie Pro Leu Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu 
85 90 95 

Val Ala Asp Thr Asn Asn Ser Gin Leu Leu Ala Cys Gly Pro Thr Ala 
100 105 110 
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Gin Arg Ala Cya Ala Lys Asn Met Tyr Ala Lys Gly Ser Cys Leu Leu 
115 120 125 

Leu Gly Ser Ser Leu Gin Phe He Gin Ala He Pro Ala Thr Met Pro 
130 135 140 

Glu Cys Pro Gly Gin Glu Met Asp He Ala Phe Leu He Asp Gly Ser 
145 150 155 160 

Gly Ser He Asp Gin Ser Asp Phe Thr Gin Met Lys Asp Phe Val Lys 
165 170 175 

Ala Leu Met Gly Gin Leu Ala Ser Thr Ser Thr Ser Phe Ser Leu Met 
180 185 190 

Gin Tyr Ser Aen He Leu Lys Thr His Phe Thr Phe Thr Glu Phe Lys 
195 200 205 

Ser Ser Leu Ser Pro Cln Ser Leu Val Asp Ala He Val Gin Leu Gin 
210 215 220 

Gly Leu Thr Tyr Thr Ala Ser Gly He Gin Lys Val Val Lys Glu Leu 
225 230 235 240 

Phe His Ser Lys Asn Gly Ala Arg Lys Ser Ala Lys Lys He Leu He 
245 250 255 

Val He Thr Asp Gly Cln Lye Phe Arg Asp Pro Leu Glu Tyr Arg His 
260 265 270 

Val He Pro Glu Ala Glu Lys Ala Gly He He Arg Tyr Ala He Glv 
275 280 285 

Val Gly Asp Ala Phe Arg Glu Pro Thr Ala Leu Gin Glu Leu Asn Thr 
290 295 300 

He Gly Ser Ala Pro Ser Gin Asp His Val Phe Lys Val Gly Asn Phe 
305 310 315 320 

Val Ala Leu Arg Ser He Gin Arg Gin He Gin Glu Lys He Phe Ala 
325 330 335 

He Glu Gly Thr Glu Ser Arg Ser Ser Ser Ser Phe Gin His Glu Met 
340 345 350 

Ser Gin Glu Gly Phe Ser Ser Ala Leu Ser Met Asp Gly Pro Val Leu 
355 360 365 

Gly Ala Val Gly Gly Phe Ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro 
370 375 380 

Ser Asn Met Arg Ser Thr Phe He Asn Met Ser Gin Glu Asn Glu Asp 

390 395 400 

Met Arg Asp Ala Tyr Leu Gly Tyr Ser Thr Ala Leu Ala Phe Trp Lys 
405 410 415 

Gly Val His Ser Leu He Leu Gly Ala Pro Arg His Gin His Thr Gly 
420 425 430 



'NSDOCID: <WO 9517412A1J_> 



wo 95/17412 



PCT/US94/14832 



- 125 - 

Lye Val Val lie Phe Thr Gin Glu Ser Arg Hie Trp Arg Pro Lys Ser 
435 440 445 

Glu Val Arg Gly Thr Gin lie Gly Ser Tyr Phe Gly Ala Ser Leu Cys 
450 455 460 

Ser Val Asp Met Asp Arg Asp Gly Ser Thr Asp Leu Val Leu lie Gly 
465 470 475 480 

val Pro His Tyr Tyr Glu His Thr Arg Gly Gly Gin Val Ser Val Cys 
485 490 495 

Pro Met Pro Gly Val Arg Ser Arg Trp His Cys Gly Thr Thr Leu His 
500 505 510 

Gly Glu Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val 
515 520 525 

Leu Gly Asp Val Asn Gly Asp Ser Leu Ala Asp Val Ala He Gly Ala 
530 535 540 

Pro Gly Glu Glu Glu Asn Arg Gly Ala Val Tyr He Phe His Gly Ala 
545 550 555 560 

Ser Arg Gin Asp He Ala Pro Ser Pro Ser Gin Arg Val Thr Gly Ser 
565 570 575 

Gin Leu Phe Leu Arg Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly 
580 585 590 

Gin Asp Leu Thr Gin Asp Gly Leu Val Asp Leu Ala Val Gly Ala Gin 
595 600 605 

Gly His Val Leu Leu Leu Arg Ser Leu Pro Leu Leu Lys Val Gly He 
610 615 620 

Ser He Arg Phe Ala Pro Ser Glu Val Ala Lys Thr Val Tyr Gin Cys 
625 630 635 640 

Trp Gly Arg Thr Pro Thr Val Leu Glu Ala Gly Glu Ala Thr Val Cys 
645 650 655 

Leu Thr Val Arg Lys Gly Ser Pro Asp Leu Leu Gly Asp Val Gin Ser 
660 665 670 

Ser Val Arg Tyr Asp Leu Ala Leu Asp Pro Gly Arg Leu He Ser Arg 
675 680 685 

Ala He Phe Asp Glu Thr Lys Asn Cys Thr Leu Thr Arg Arg Lys Thr 
690 695 700 

Leu Gly Leu Gly Asp His Cys Glu Thr Met Lys Leu Leu Leu Pro Asp 
705 710 715 720 

Cys Val Glu Asp Ala Val Thr Pro He He Leu Arg Leu Asn Leu Ser 
725 730 735 

Leu Ala Gly Asp Ser Ala Pro Ser Arg Asn Leu Arg Pro Val Leu Ala 
740 745 750 
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Val Gly Ser Gin Asp Hie Val Thr Ala Ser Phe Pro Phe Glu Lys Asn 
755 760 765 

CyB Glu Gly Asn Leu Gly Val Ser Phe Asn Phe Ser Gly Leu Gin Val 
770 775 780 

Leu Glu Val Gly Ser Ser Pro Glu Leu Thr Val Thr Val Thr Val Trp 
785 790 795 800 

Asn Glu Gly Glu Asp Ser Tyr Gly Thr Leu lie Lys Phe Tyr Tyr Pro 
805 810 815 

Ala Glu Leu Ser Tyr Arg Arg Val Thr Arg Ala Gin Gin Pro His Pro 
820 825 830 

Tyr Pro Leu Arg Leu Ala Cys Glu Ala Glu Pro Thr Gly Gin Glu Ser 
835 840 845 

Leu Arg Ser Ser Ser Cys Ser lie Asn His Pro lie Phe Arg Glu Gly 
850 855 860 

Ala Lys Ala Thr Phe Met lie Thr Phe Asp Val Ser Tyr Lys Ala Phe 
865 870 875 880 

Leu Gly Asp Arg Leu Leu Leu Arg Ala Ser Ala Ser Ser Glu Asn Asn 
885 890 895 

Lys Pro Glu Thr Ser Lys Thr Ala Phe Gin Leu Glu Leu Pro Val Lys 
900 905 910 

Tyr Thr Val Tyr Thr Val He Ser Arg Gin Glu Asp Ser Thr Lys His 
915 920 925 

Phe Asn Phe Ser Ser Ser His Gly Glu Arg Gin Lys Glu Ala Glu His 
930 935 940 

Arg Tyr Arg Val Asn Asn Leu Ser Pro Leu Thr Leu Ala He Ser Val 
945 950 955 960 

Asn Phe Trp Val Pro He Leu Leu Asn Gly Val Ala Val Trp Asp Val 
965 970 975 

Thr Leu Arg Ser Pro Ala Gin Gly Val Ser Cys Val Ser Gin Arg Glu 
980 985 990 

Pro Pro Gin His Ser Asp Leu Leu Thr Gin He Gin Gly Arg Ser Val 
995 1000 1005 

Leu Asp Cys Ala He Ala Asp Cys Leu His Leu Arg Cys Asp He Pro 
1010 1015 1020 

Ser Leu Gly Thr Leu Asp Glu Leu Asp Phe He Leu Lys Gly Asn Leu 
1025 1030 1035 1040 

Ser Phe Gly Trp He Ser Gin Thr Leu Gin Lys Lys Val Leu Leu Leu 
1045 1050 1055 

Ser Glu Ala Glu He Thr Phe Asn Thr Ser Val Tyr Ser Gin Leu Pro 
1060 1065 1070 
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Gly Gin Glu Ala Phe Leu Arg Ala Gin Val Ser Thr Met Leu Glu Glu 
1075 1080 1085 

Tyr Val Val Tyr Glu Pro Val Phe Leu Met Val Phe Ser Ser Val Gly 
1090 1095 1100 

Glv Leu Leu Leu Leu Ala Leu He Thr Val Ala Leu Tyr Lys Leu Gly 
1105 1110 1115 1120 

Phe Phe Lys Arg Gin Tyr Lys Glu Met Leu Asp Leu Pro Ser Ala Asp 
1125 1130 1135 

Pro Asp Pro Ala Gly Gin Ala Asp Ser Asn His Glu Thr Pro Pro His 
1140 1145 1150 

Leu Thr Ser 
1155 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

< C ) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

< ii ) MOLECULE TYPE : DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
AGTTACGGAT CCGGCACCAT GACCTTCGGC ACTGTGATCC TCCTGTGTG 49 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
GCTGGACGAT GGCATCCAC 
(2) INFORMATION FOR SEQ ID NO: 49: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

< C ) STRANDEDNESS : single 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
GTAGAGTTAC GGATCCGGCA CCAT 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GCAGCCAGCT TCGGACAGAC 
(2) INFORMATION FOR SEQ ID NOsSl: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
CCATGTCCAC AGAACAGAGA G 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3803 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: l.,3486 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 : 

ATG GTC CGT GGA GTT GTG ATC CTC CTG TGT GGC TGG GCC CTG GCT TCC 48 
Met Val Arg Gly Val Val He Leu Leu Cys Gly Trp Ala Leu Ala Ser 
-L 5 10 



15 



TGT CAT GGG TCT AAC CTG GAT GTG GAG AAG CCC GTC GTG TTC AAA GAG 96 
Cys Hxs Gly Ser Asn Leu Asp Val Glu Lys Pro Val Val Phe Lys Glu 
20 25 30 
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GAT GCA GCC AGC TTC GGA CAG ACT GTG GTG GAG TTT GGT GGA TCT CGA 144 
Asp Ala Ala Ser Phe Gly Gin Thr Val Val Gin Phe Gly Gly Ser Arg 
35 40 45 

CTC GTG GTG GGA GCC CCT CTG GAG GCG GTG GCA GTC AAC CAA ACA GGA 192 
Leu Val Val Gly Ala Pro Leu Glu Ala Val Ala Val Asn Gin Thr Gly 
50 55 60 

CAG TCG TCT GAC TGT CCG CCT GCC ACT GGC GTG TGC CAG CCC ATC TTA 240 
Gin ser Ser Asp Cys Pro Pro Ala Thr Gly Val Cys Gin Pro lie Leu 
65 70 75 80 

CTG CAC ATT CCC CTA GAG GCA GTG AAC ATG TCC CTG GGC CTG TCT CTG 288 
Leu His lie Pro Leu Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu 
85 90 95 

GTG GCT GAC ACC AAT AAC TCC CAG TTG CTG GCT TGT GGT CCA ACT GCA 336 
Val Ala Asp Thr Asn Asn Ser Gin Leu Leu Ala Cys Gly Pro Thr Ala 
100 105 110 

CAG AGA GCT TGT GCA AAG AAC ATG TAT GCA AAA GGT TCC TGC CTC CTT 384 
Gin Arg Ala Cys Ala Lys Asn Met Tyr Ala Lys Gly Ser Cys Leu Leu 
115 120 125 

CTG GGC TCC AGC TTG CAG TTC ATC CAG GCA ATC CCT GCT ACC ATG CCA 432 
Leu Gly Ser Ser Leu Gin Phe lie Gin Ala lie Pro Ala Thr Met Pro 
130 135 140 

GAG TGT CCA GGA CAA GAG ATG GAC ATT GCT TTC CTG ATT GAT GGC TCC 480 
Glu Cys Pro Gly Gin Glu Met Asp lie Ala Phe Leu lie Asp Gly Ser 
145 150 155 160 

GGC AGC ATT GAT CAA AGT GAC TTT ACC CAG ATG AAG GAC TTC GTC AAA 528 
Gly Ser lie Asp Gin Ser Asp Phe Thr Gin Met Lys Asp Phe Val Lys 
165 170 175 

GCT TTG ATG GGC CAG TTG GCG AGC ACC AGC ACC TCG TTC TCC CTG ATG 576 
Ala Leu Met Gly Gin Leu Ala Ser Thr Ser Thr Ser Phe Ser Leu Met 
180 185 190 

CAA TAG TCA AAC ATC CTG AAG ACT CAT TTT ACC TTC ACG GAA TTC AAG 624 
Gin Tyr Ser Asn He Leu Lys Thr His Phe Thr Phe Thr Glu Phe Lys 
195 200 205 

AGC AGC CTG AGC CCT CAG AGC CTG GTG GAT GCC ATC GTC CAG CTC CAA 672 
Ser ser Leu Ser Pro Gin Ser Leu Val Asp Ala He Val Gin Leu Gin 
210 215 220 

GGC CTG ACG TAC ACA GCC TCG GGC ATC CAG AAA GTG GTG AAA GAG CTA 720 
Gly Leu Thr Tyr Thr Ala Ser Gly He Gin Lys Val Val Lys Glu Leu 
225 230 235 240 

TTT CAT AGC AAG AAT GGG GCC CGA AAA AGT GCC AAG AAG ATA CTA ATT 768 
Phe His Ser Lys Asn Gly Ala Arg Lys Ser Ala Lys Lys He Leu He 
245 250 255 

GTC ATC ACA GAT GGG CAG AAA TTC AGA GAC CCC CTG GAG TAT AGA CAT 816 
Val He Thr Asp Gly Gin Lys Phe Arg Asp Pro Leu Glu Tyr Arg His 
260 265 270 
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GTC ATC CCT GAA GCA GAG AAA GCT GGG ATC ATT CGC TAT GCT ATA GGG 864 
Val He Pro Glu Ala Glu Lys Ala Gly He He Arg Tyr Ala He Glv 
275 280 285 

GTG GGA GAT GCC TTC CGG GAA CCC ACT GCC CTA CAG GAG CTG AAC ACC 912 
Val Gly Asp Ala Phe Arg Glu Pro Thr Ala Leu Gin Glu Leu Asn Thr 
290 295 300 

ATT GGC TCA GCT CCC TCG CAG GAC CAC GTG TTC AAG GTG GGC AAT TTT 960 
He Gly Ser Ala Pro Ser Gin Asp His Val Phe Lys Val Gly Asn Phe 

310 315 320 

GTA GCA CTT CGC AGC ATC CAG CGG CAA ATT CAG GAG AAA ATC TTT GCC 1008 
Val Ala Leu Arg Ser He Gin Arg Gin He Gin Glu Lys He Phe Ala 
325 330 335 

ATT GAA GGA ACC GAA TCA AGG TCA AGT AGT TCC TTT CAG CAC GAG ATG 1056 
He Glu Gly Thr Glu Ser Arg Ser Ser Ser Ser Phe Gin His Glu Met 
340 345 350 

TCA CAA GAA GGT TTC AGC TCA GCT CTC TCA ATG GAT GGA CCA GTT CTG 
Ser Gin Glu Gly Phe Ser Ser Ala Leu Ser Met Asp Gly Pro Val Leu 
355 360 365 

GGG GCT GTG GGA GGC TTC AGC TGG TCT GGA GGT GCC TTC TTG TAG CCC 1152 

^'^^ Gly Gly Ala Phe Leu Tyr Pro 

370 375 

TCA AAT ATG AGA TCC ACC TTC ATC AAC ATG TCT CAG GAG AAC GAG GAT 1200 
Ser Asn Met Arg Ser Thr Phe He Asn Met Ser Gin Glu Asn Glu Asp 

390 395 400 

ATG AGG GAC GCT TAC CTG GGT TAC TCC ACC GCA CTG GCC TTT TGG AAG 1248 
Met Arg Asp Ala Tyr Leu Gly Tyr Ser Thr Ala Leu Ala Phe Trp Lys 
405 410 415 ^ 

GGG GTC CAC AGC CTG ATC CTG GGG GCC CCT CGC CAC CAG CAC ACG GGG 1296 
Gly Val Hxs Ser Leu He Leu Gly Ala Pro Arg His Gin His Thr Gly 
420 425 430 

AAG GTT GTC ATC TTT ACC CAG GAA TCC AGG CAC TGG AGG CCC AAG TCT 1344 
Lys Val Val He Phe Thr Gin Glu Ser Arg His Trp Arg Pro Lys Ser 
435 440 445 

GAA GTC AGA GGG ACA CAG ATC GGC TCC TAC TTT GGG GCA TCT CTC TGT 

^"^^ Yfi ser Tyr Phe Gly Ala Ser Leu Cys 

450 455 ^ 

TCT GTG GAC ATG GAT AGA GAT GGC AGC ACT GAC CTG GTC CTG ATT GGA 1440 
Ser Val Asp Met Asp Arg Asp Gly Ser Thr Asp Leu Val Leu He Gly 

470 475 480 

GTC CCC CAT TAC TAT GAG CAC ACC CGA GGG GGG CAG GTG TCG GTG TGC 1488 
Val Pro His Tyr Tyr Glu His Thr Arg Gly Gly Gin Val Ser Val Cys 
485 490 495 

CCC ATG CCT GGT GTG AGG AGC AGG TGG CAT TGT GGG ACC ACC CTC CAT 1536 
Pro Met Pro Gly Val Arg Ser Arg Trp His Cys Gly Thr Thr Leu His 
500 505 510 
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GGG GAG CAG GGC CAT CCT TGG GGC CGC TTT GGG GCG GOT CTG ACA GTG 1584 
Gly Glu Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val 
515 520 525 

CTA GGG GAG GTG AAT GGG GAC AGT CTG GCG GAT GTG GCT ATT GGT GCA 1632 
Leu Gly Asp Val Asn Gly Asp Ser Leu Ala Asp Val Ala lie Gly Ala 
530 535 540 

CCC GGA GAG GAG GAG AAC AGA GGT GCT GTC TAG ATA TTT CAT GGA GCC 1680 
Pro Gly Glu Glu Glu Asn Arg Gly Ala Val Tyr lie Phe His Gly Ala 
545 550 555 560 

TCG AGA CAG GAC ATC GCT CCC TCG CCT AGC CAG CGG GTC ACT GGC TCC 1728 
Ser Arg Gin Asp lie Ala Pro Ser Pro Ser Gin Arg Val Thr Gly Ser 
565 570 575 

CAG CTC TTC CTG AGG CTC CAA TAT TTT GGG CAG TCA TTA AGT GGG GGT 177 6 

Gin Leu Phe Leu Arg Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly 
580 585 590 

CAG GAC CTT ACA CAG GAT GGC CTG GTG GAC CTG GCC GTG GGA GCC CAG 1824 
Gin Asp Leu Thr Gin Asp Gly Leu Val Asp Leu Ala Val Gly Ala Gin 
595 600 605 

GGG CAC GTG CTG CTG CTT AGG AGT CTG CCT TTG CTG AAA GTG GGG ATC 1872 
Gly His Val Leu Leu Leu Arg Ser Leu Pro Leu Leu Lys Val Gly lie 
610 615 620 

TCC ATT AGA TTT GCC CCC TCA GAG GTG GCA AAG ACT GTG TAC CAG TGC 1920 
Ser lie Arg Phe Ala Pro Ser Glu Val Ala Lys Thr Val Tyr Gin Cys 
625 630 635 640 

TGG GGA AGG ACT CCC ACT GTC CTC GAA GCT GGA GAG GCC ACC GTC TGT 1968 
Trp Gly Arg Thr Pro Thr Val Leu Glu Ala Gly Glu Ala Thr Val Cys 
645 650 655 

CTC ACT GTC CGC AAA GGT TCA CCT GAC CTG TTA GGT GAT GTC CAA AGC 2016 
Leu Thr Val Arg Lys Gly Ser Pro Asp Leu Leu Gly Asp Val Gin Ser 
660 665 670 

TCT GTC AGG TAT GAT CTG GCG TTG GAT CCG GGC CGT CTG ATT TCT CGT 2064 
Ser Val Arg Tyr Asp Leu Ala Leu Asp Pro Gly Arg Leu lie Ser Arg 
675 680 685 

GCC ATT TTT GAT GAG ACG AAG AAC TGC ACT TTG ACC CGA AGG AAG ACT 2112 
Ala lie Phe Asp Glu Thr Lys Asn Cys Thr Leu Thr Arg Arg Lys Thr 
690 695 700 

CTG GGG CTT GGT GAT CAC TGC GAA ACA ATG AAG CTG CTT TTG CCA GAC 2160 
Leu Gly Leu Gly Asp His Cys Glu Thr Met Lys Leu Leu Leu Pro Asp 
705 710 715 720 

TGT GTG GAG GAT GCA GTG ACC CCT ATC ATC CTG CGC CTT AAC TTA TCC 220S 
Cys Val Glu Asp Ala Val Thr Pro lie lie Leu Arg Leu Asn Leu Ser 
725 730 735 

CTG GCA GGG GAC TCT GCT CCA TCC AGG AAC CTT CGT CCT GTG CTG GCT 2256 
Leu Ala Gly Asp Ser Ala Pro Ser Arg Asn Leu Arg Pro Val Leu Ala 
740 745 750 
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GTG GGC TCA CAA GAC CAT GTA ACA GCT TCT TTC CCG TTT GAG AAG AAC 2304 

Val Gly Ser Gin Asp His Val Thr Ala Ser Phe Pro Phe Glu Lys Asn 

755 760 765 

TGT AAG CAG GAG CTC CTG TGT GAG GGG AAC CTG GGC GTC AGC TTC AAC 2352 
Cys Lys Gin Glu Leu Leu Cys Glu Gly Asn Leu Gly Val Ser Phe Asn 
770 775 780 

TTC TCA GGC CTG CAG GTC TTG GAG GTA GGA AGC TCC CCA GAG CTC ACT 2400 
Phe ser Gly Leu Gin Val Leu Glu Val Gly Ser Ser Pro Glu Leu Thr 
785 790 795 800 

GTG ACA GTA ACA GTT TGG AAT GAG GGT GAG GAC AGC TAT GGA ACC TTA 2448 
Val Thr Val Thr Val Trp Asn Glu Gly Glu Asp Ser Tyr Gly Thr Leu 
805 810 815 

ATC AAG TTC TAG TAG CCA GCA GAG CTA TCT TAC CGA CGG GTG ACA AGA 2496 
±±e Lys Phe Tyr Tyr Pro Ala Glu Leu Ser Tyr Arg Arg Val Thr Ara 
S20 825 830 

GCC CAG CAA CCT CAT CCG TAC CCA CTA CGC CTG GCA TGT GAG GCT GAG 2544 
Ala Gin Gin Pro His Pro Tyr Pro Leu Arg Leu Ala Cys Glu Ala Glu 
835 840 845 

CCC ACG GGC CAG GAG AGC CTG AGG AGC AGC AGC TGT AGC ATC AAT CAC 2592 
Pro Thr Gly Gin Glu Ser Leu Arg Ser Ser Ser Cys Ser lie Asn His 
S50 855 860 

CCC ATC TTC CGA GAA GGT GCC AAG GCC ACC TTC ATG ATC ACA TTT GAT 2640 
Pro He Phe Arg Glu Gly Ala Lys Ala Thr Phe Met He Thr Phe Asp 

©"70 875 880 

CTC TCC TAC AAG GCC TTC CTG GGA GAC AGG TTG CTT CTG AGG GCC AGC 2688 
-al Ser Tyr Lys Ala Phe Leu Gly Asp Arg Leu Leu Leu Arg Ala Ser 
885 890 895 

GCA AGC AGT GAG AAT AAT AAG CCT GAA ACC AGC AAG ACT GCC TTC CAG 2736 
..-a Ser Ser Glu Asn Asn Lys Pro Glu Thr Ser Lys Thr Ala Phe Gin 
900 905 910 

"C GAG CTT CCG GTG AAG TAC ACG GTC TAT ACC GTG ATC AGT AGG CAG 2784 
-eu Glu Leu Pro Val Lys Tyr Thr Val Tyr Thr Val He Ser Arg Gin 
915 920 925 

GAA GAT TCT ACC AAG CAT TTC AAC TTC TCA TCT TCC CAC GGG GAG AGA 2832 
Glu Asp Ser Thr Lys His Phe Asn Phe Ser Ser Ser His Gly Glu Arg 

935 ^ 

-:-G AAA GAG GCC GAA CAT CGA TAT CGT GTG AAT AAC CTG AGT CCA TTG 
O^n Lys Glu Ala Glu His Arg Tyr Arg Val Asn Asn Leu Ser Pro Leu 
"■^^ 950 955 960 

ACG CTG GCC ATC AGC GTT AAC TTC TGG GTC CCC ATC CTT CTG AAT GGT 2928 
-nr Leu Ala He Ser Val Asn Phe Trp Val Pro He Leu Leu Asn Glv 
965 970 975 

w^? CCA GCA CAG GGT GTC TCC 2976 

.1 Ala Val Trp Asp Val Thr Leu Arg Ser Pro Ala Gin Gly Val Ser 
980 985 990 
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TGT GTG TCA CAG AGG GAA CCT CCT CAA CAT TCC GAC CTT CTG ACC CAG 3024 
Cys Val Ser Gin Arg Glu Pro Pro Gin His Ser Asp Leu Leu Thr Gin 
995 1000 1005 

ATC CAA GGA CGC TCT GTG CTG GAC TGC GCC ATC GCC GAC TGC CTG CAC 3072 
lie Gin Gly Arg Ser Val Leu Asp Cys Ala lie Ala Asp Cys Leu His 
1010 1015 1020 

CTC CGC TGT GAC ATC CCC TCC TTG GGC ACC CTG GAT GAG CTT GAC TTC 3120 
Leu Arg Cys Asp lie Pro Ser Leu Gly Thr Leu Asp Glu Leu Asp Phe 
1025 1030 1035 1040 

ATT CTG AAG GCC AAC CTC AGC TTC GGC TGG ATC AGT CAG ACA TTG CAG 3168 
lie Leu Lys Gly Asn Leu Ser Phe Gly Trp lie Ser Gin Thr Leu Gin 
1045 1050 1055 

AAA AAG GTC TTC CTC CTG AGT GAG GCT GAA ATC ACA TTC AAC ACA TCT 3216 
Lys Lys Val Leu Leu Leu Ser Glu Ala Glu lie Thr Phe Asn Thr Ser 
1060 1065 1070 

GTG TAT TCC CAC CTC CCC GGA CAG GAG GCA TTT CTG AGA GCC CAG GTG 3264 
Val Tyr Ser Gin Leu Pro Gly Gin Glu Ala Phe Leu Arg Ala Gin Val 
1075 1080 1085 

TCA ACG ATC CTA CAA GAA TAC GTG GTC TAT GAG CCC GTC TTC CTC ATG 3312 
Ser Thr Met Leu Glu Glu Tyr Val Val Tyr Glu Pro Val Phe Leu Met 
1090 1095 1100 

GTG TTC AGC TCA CTG GGA GGT CTG CTG TTA CTG GCT CTC ATC ACT GTG 3360 
Val Phe Ser Ser Val Gly Gly Leu Leu Leu Leu Ala Leu lie Thr Val 
1105 1110 1115 1120 

GCG CTG TAC AAG CTT GGC TTC TTC AAA CGT CAG TAT AAA GAG ATG CTG 3408 
Ala Leu Tyr Lys Leu Gly Phe Phe Lys Arg Gin Tyr Lys Glu Met Leu 
1125 1130 1135 

GAT CTA CCA TCT GCA CAT CCT GAC CCA GCC GGC CAG GCA GAT TCC AAC 3456 
Asp Leu Pro Ser Ala Asp Pro Asp Pro Ala Gly Gin Ala Asp Ser Asn 
1140 1145 1150 

CAT GAG ACT CCT CCA CAT CTC ACG TCC TAGGAATCTA CTTTCCTGTA 3503 
His Glu Thr Pro Pro His Leu Thr Ser 
1155 1160 

TATCTCCACA ATTACGAGAT TGGTTTTGCT TTTGCCTATG AATCTACTGG CATGGGAACA 3563 

AGTTCTCTTC AGCTCTGGGC TAGCCTGGGA AACTTCCCAG AAATGATGCC CTACCTCCTG 3623 

AGCTGGGAGA TTTTTATGGT TTGCCCATGT GTCAGATTTC AGTGCTGATC CACTTTTTTT 3683 

GCAAGAGCAG GAATGGGGTC AGCATAAATT TACATATGGA TAAGAACTAA CACAAGACTG 3743 

AGTAATATGC TCAATATTCA ATGTATTGCT TGTATAAATT TTTAAAAAAT AAAATGAAAN 3803 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 amino acids 
<B) TYPE: amino acid 
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(D) TOPOLOGY s linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: S3: 

Met Val Arg Gly Val Val lie Leu Leu Cys Gly Trp Ala Leu Ala Ser 
15 10 15 

Cys His Gly Ser Asn Leu Asp Val Glu Lye Pro Val Val Phe Lvs Glu 
20 25 30 

Asp Ala Ala Ser Phe Gly Gin Thr Val Val Gin Phe Gly Gly Ser Ara 
35 40 45 

Leu Val Val Gly Ala Pro Leu Glu Ala Val Ala Val Asn Gin Thr Glv 
50 55 60 

Gin Ser Ser Asp Cys Pro Pro Ala Thr Gly Val Cys Gin Pro lie Leu 

70 75 80 

Leu His lie Pro Leu Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu 
85 90 95 

Val Ala Asp Thr Asn Asn Ser Gin Leu Leu Ala Cys Gly Pro Thr Ala 
100 105 110 

Gin Arg Ala Cys Ala Lys Asn Met Tyr Ala Lys Gly Ser Cys Leu Leu 
115 120 125 

Leu Gly Ser Ser Leu Gin Phe He Gin Ala He Pro Ala Thr Met Pro 
130 135 240 

Glu Cys Pro Gly Gin Glu Met Asp He Ala Phe Leu He Asp Gly Ser 

150 155 160 

Gly Ser He Asp Gin Ser Asp Phe Thr Gin Met Lys Asp Phe Val Lys 
165 170 175 

Ala Leu Met Gly Gin Leu Ala Ser Thr Ser Thr Ser Phe Ser Leu Met 
ISO 185 190 

Gin Tyr Ser Asn .He Leu Lys Thr His Phe Thr Phe Thr Glu Phe Lve 
195 200 205 

f?^ "^^1 Asp Ala He Val Gin Leu Gin 

210 215 220 

Gly Leu Thr Tyr Thr Ala Ser Gly He Gin Lys Val Val Lys Glu Leu 

230 235 240 

Phe His Ser Lys Asn Gly Ala Arg Lys Ser Ala Lys Lys He Leu He 
245 250 255 

Val He Thr Asp Gly Gin Lys Phe Arg Asp Pro Leu Glu Tyr Arg His 
260 265 270 

Val He Pro Glu Ala Glu Lys Ala Gly He He Arg Tyr Ala He Gly 
275 280 285 
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Val Gly Asp Ala Phe Arg Glu Pro Thr Ala Leu Gin Glu Leu Asn Thr 
290 295 300 

lie Gly Ser Ala Pro Ser Gin Asp His Val Phe Lys Val Gly Asn Phe 
305 310 315 320 

Val Ala Leu Arg Ser lie Gin Arg Gin lie Gin Glu Lys lie Phe Ala 
325 330 335 

lie Glu Gly Thr Glu Ser Arg Ser Ser Ser Ser Phe Gin His Glu Met 
340 345 350 

Ser Gin Glu Gly Phe Ser Ser Ala Leu Ser Met Asp Gly Pro Val Leu 
355 360 365 

Gly Ala Val Gly Gly Phe Ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro 
370 375 380 

Ser Asn Met Arg Ser Thr Phe lie Asn Met Ser Gin Glu Asn Glu Asp 
385 390 395 400 

Met Arq Asp Ala Tyr Leu Gly Tyr Ser Thr Ala Leu Ala Phe Trp Lys 
405 410 415 

Gly Val His Ser Leu lie Leu Gly Ala Pro Arg His Gin His Thr Gly 
420 425 430 

Lys Val Val lie Phe Thr Gin Glu Ser Arg His Trp Arg Pro Lys Ser 
435 440 445 

Glu Val Arg Gly Thr Gin lie Gly Ser Tyr Phe Gly Ala Ser Leu Cys 
450 455 460 

Ser Val Asp Met Asp Arg Asp Gly Ser Thr Asp Leu Val Leu lie Gly 
465 470 475 480 

Val Pro His Tyr Tyr Glu His Thr Arg Gly Gly Gin Val Ser Val Cys 
485 490 495 

Pro Met Pro Gly Val Arg Ser Arg Trp His Cys Gly Thr Thr Leu His 
500 505 510 

Gly Glu Gin Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val 
515 520 525 

Leu Gly Asp Val Asn Gly Asp Ser Leu Ala Asp Val Ala lie Gly Ala 
530 535 540 

Pro Gly Glu Glu Glu Asn Arg Gly Ala Val Tyr lie Phe His Gly Ala 
545 550 555 560 

Ser Arg Gin Asp lie Ala Pro Ser Pro Ser Gin Arg Val Thr Gly Ser 
565 570 575 

Gin Leu Phe Leu Arg Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly 
580 585 590 

Gin Asp Leu Thr Gin Asp Gly Leu Val Asp Leu Ala Val Gly Ala Gin 
595 600 605 
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Gly His Val Leu Leu Leu Arg Ser Leu Pro Leu Leu Lys Val Glv lie 

615 620 

Ser lie Arg Phe Ala Pro Ser Glu Val Ala Lys Thr Val Tyr Gin Cys 

630 635 640 

Trp Gly Arg Thr Pro Thr Val Leu Glu Ala Gly Glu Ala Thr Val Cve 
645 650 655 

Leu Thr Val Arg Lys Gly Ser Pro Asp Leu Leu Gly Aap Val Gin Ser 
660 665 670 

Ser Val Arg Tyr Asp Leu Ala Leu Asp Pro Gly Arg Leu He Ser Arg 
675 680 685 

lit '^^^ ^i^r Arg Arg Lvs Thr 

690 695 700 ^ 

705 "^^ ^^"^ ^^"^ Asp 

710 715 

Cys Val Glu Asp Ala Val Thr Pro He He Leu Arg Leu Asn Leu Ser 

. 725 730 735 

Leu Ala Gly Asp Ser Ala Pro Ser Arg Asn Leu Arg Pro Val Leu Ala 

745 750 

Val Gly Ser Gin Asp His Val Thr Ala Ser Phe Pro Phe Glu Lys Asn 
755 760 765 

Cys Lye Gin Glu Leu Leu Cys Glu Gly Asn Leu Gly Val Ser Phe Asn 
''^ 775 780 

Phe Ser Gly Leu Gin Val Leu Glu Val Gly Ser Ser Pro Glu Leu Thr 

795 800 

Val Thr Val Thr Val Trp Asn Glu Gly Glu Asp Ser Tyr Gly Thr Leu 
805 810 815 

He Lys Phe Tyr Tyr Pro Ala Glu Leu Ser Tyr Arg Arg Val Thr Arg 
®20 825 830 

Ala Gin Gin Pro His Pro Tyr Pro Leu Arg Leu Ala Cys Glu Ala Glu 

840 845 

Pro Thr Gly Gin Glu Ser Leu Arg Ser Ser Ser Cys Ser He Asn His 

855 860 

Pro He Phe Arg Glu Gly Ala Lys Ala Thr Phe Met He Thr Phe Asp 

val Ser Tyr Lys Ala Phe Leu Gly Asp Arg Leu Leu Leu Arg Ala Ser 
885 890 895 

Ala ser Ser Glu Asn Asn Lys Pro Glu Thr Ser Lys Thr Ala Phe Gin 
900 905 910 

Leu Glu Leu Pro Val Lys Tyr Thr Val Tyr Thr Val He Ser Arg Gin 

920 925 
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Glu Asp Ser Thr Lys His Phe Asn Phe Ser Ser Ser His Gly Glu Arg 
930 935 940 

Gin Lys Glu Ala Glu His Arg Tyr Arg Val Asn Asn Leu Ser Pro Leu 
945 950 955 960 

Thr Leu Ala lie Ser Val Asn Phe Trp Val Pro lie Leu Leu Asn Gly 
965 970 975 

Val Ala Val Trp Asp Val Thr Leu Arg Ser Pro Ala Gin Gly Val Ser 
980 985 990 

Cys Val Ser Gin Arg Glu Pro Pro Gin His Ser Asp Leu Leu Thr Gin 
995 1000 1005 

lie Gin Gly Arg Ser Val Leu Asp Cys Ala lie Ala Asp Cys Leu His 
1010 1015 1020 

Leu Arg Cys Asp lie Pro Ser Leu Gly Thr Leu Asp Glu Leu Asp Phe 
1025 1030 ' 1035 1040 

lie Leu Lys Gly Asn Leu Ser Phe Gly Trp lie Ser Gin Thr Leu Gin 
1045 1050 1055 

Lys Lys Val Leu Leu Leu Ser Glu Ala Glu lie Thr Phe Asn Thr Ser 
1060 1065 1070 

Val Tyr Ser Gin Leu Pro Gly Gin Glu Ala Phe Leu Arg Ala Gin Val 
1075 1080 1085 

Ser Thr Met Leu Glu Glu Tyr Val Val Tyr Glu Pro Val Phe Leu Met 
1090 1095 1100 

Val Phe Ser Ser Val Gly Gly Leu Leu Leu Leu Ala Leu lie Thr Val 
1105 1110 1115 1120 

Ala Leu Tyr Lys Leu Gly Phe Phe Lys Arg Gin Tyr Lys Glu Met Leu 
1125 1130 1135 

Asp Leu Pro Ser Ala Asp Pro Asp Pro Ala Gly Gin Ala Asp Ser Asn 
1140 1145 1150 

His Glu Thr Pro Pro His Leu Thr Ser 
1155 1160 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3597 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 40.. 352 5 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54r 

AGCTTTACAG CTCTCTACTT CTCAGTGCAC TGCTCAGTG ATG GCC GGT GGA GTT 54 

Met Ala Gly Gly Val 
1 5 

GTG ATC CTC CTG TGT GGC TGG GTC CTG GCT TCC TGT CAT GGG TCT AAC 102 
Val He Leu Leu Cys Gly Trp Val Leu Ala Ser Cys His Gly Ser Asn 
10 15 20 

CTG GAT GTG GAG GAA CCC ATC GTG TTC AGA GAG GAT GCA GCC AGC TTT ISO 
Leu Asp Val Glu Glu Pro He Val Phe Arg Glu Asp Ala Ala Ser Phe 
25 30 35 

GGA CAG ACT GTG GTG CAG TTT GGT GGA TCT CGA CTC GTG GTG GGA GCC 198 
Gly Gin Thr Val Val Gin Phe Gly Gly Ser Arg Leu Val Val Gly Ala 
40 45 50 

CCT CTG GAG GCG GTG GCA GTC AAC CAA ACA GGA CGG TTG TAT GAC TGT 246 
Pro Leu Glu Ala Val Ala Val Asn Gin Thr Gly Arg Leu Tyr Asp Cys 
55 60 65 

GCA CCT GCC ACT GGC ATG TGC CAG CCC ATC GTA CTG CGC AGT CCC CTA 294 
Ala Pro Ala Thr Gly Met Cys Gin Pro He Val Leu Arg Ser Pro Leu 
70 75 80 85 

GAG GCA GTG AAC ATG TCC CTG GGC CTG TCT CTG GTG ACT GCC ACC AAT 342 
Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu Val Thr Ala Thr Asn 
90 95 100 

AAC GCC CAG TTG CTG GCT TGT GGT CCA ACT GCA CAG AGA GCT TGT GTG 
Asn Ala Gin Leu Leu Ala Cys Gly Pro Thr Ala Gin Arg Ala Cys Val 
105 110 115 

AAG AAC ATG TAT GCG AAA GGT TCC TGC CTC CTT CTC GGC TCC AGC TTG 
Lys Asn Met Tyr Ala Lys Gly Ser Cys Leu Leu Leu Gly Ser Ser Leu 
120 125 130 

CAG TTC ATC CAG GCA GTC CCT GCC TCC ATG CCA GAG TGT CCA AGA CAA 486 
Gin Phe He Gin Ala Val Pro Ala Ser Met Pro Glu Cys Pro Arg Gin 
135 140 145 

GAG ATG GAC ATT GCT TTC CTG ATT GAT GGT TCT GGC AGC ATT AAC CAA 
Glu Met Asp He Ala Phe Leu He Asp Gly Ser Gly Ser He Asn Gin 
150 155 160 165 

AGG GAC TTT GCC CAG ATG AAG GAC TTT GTC AAA GCT TTG ATG GGA GAG 582 
Arg Asp Phe Ala Gin Met Lys Asp Phe Val Lys Ala Leu Met Gly Glu 
170 175 180 

TTT GCG AGC ACC AGC ACC TTG TTC TCC CTG ATG CAA TAC TCG AAC ATC 
Phe Ala Ser Thr Ser Thr Leu Phe Ser Leu Met Gin Tyr ser Asn He 
185 190 195 

CTG AAG ACC CAT TTT ACC TTC ACT GAA TTC AAG AAC ATC CTG GAC CCT 678 
Leu Lys Thr His Phe Thr Phe Thr Glu Phe Lys Asn He Leu Asp Pro 
200 205 210 

CAG AGC CTG GTG GAT CCC ATT GTC CAG CTG CAA GGC CTG ACC TAC ACA 726 
Gin Ser Leu Val Asp Pro He Val Gin Leu Gin Gly Leu Thr Tyr Thr 
215 220 225 



390 



438 



534 



630 
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GCC ACA GGC ATC CGG ACA GTG ATG GAA GAG CTA TTT CAT AGC AAG AAT 774 

Ala Thr Gly lie Arg Thr Val Met Glu Glu Leu Phe His Ser Lys Asn 
230 235 240 245 

GGG TCC CGT AAA AGT GCC AAG AAG ATC CTC CTT GTC ATC ACA GAT GGG 822 
Gly Ser Arg Lys Ser Ala Lys Lys lie Leu Leu Val He Thr Asp Gly 
250 255 260 

CAG AAA TAC AGA GAC CCC CTG GAG TAT AGT GAT GTC ATT CCC GCC GCA 870 
Gin Lys Tyr Arg Asp Pro Leu Glu Tyr Ser Asp Val lie Pro Ala Ala 
265 270 275 

GAC AAA GCT GGC ATC ATT CGT TAT GCT ATT GGG GTG GGA GAT GCC TTC 918 
Asp Lys Ala Gly He lie Arg Tyr Ala He Gly Val Gly Asp Ala Phe 
280 285 290 

CAG GAG CCC ACT GCC CTG AAG GAG CTG AAC ACC ATT GGC TCA GCT CCC 966 
Gin Glu Pro Thr Ala Leu Lys Glu Leu Asn Thr He Gly Ser Ala Pro 
295 300 305 

CCA CAG GAC CAC GTG TTC AAG GTA GGC AAC TTT GCA GCA CTT CGC AGC 1014 
Pro Gin Asp His Val Phe Lys Val Gly Asn Phe Ala Ala Leu Arg Ser 
310 315 320 325 

ATC CAG AGG CAA CTT CAG GAG AAA ATC TTC GCC ATT GAG GGA ACT CAA 1062 
He Gin Arg Gin Leu Gin Glu Lys He Phe Ala He Glu Gly Thr Gin 
330 335 340 

TCA AGG TCA AGT AGT TCC TTT CAG CAC GAG ATG TCA CAA GAA GGT TTC 1110 
Ser Arg Ser Ser Ser Ser Phe Gin His Glu Met Ser Gin Glu Gly Phe 
345 350 355 

AGT TCA GCT CTC ACA TCG GAT GGA CCC GTT CTG GGG GCC GTG GGA AGC 1158 
Ser Ser Ala Leu Thr Ser Asp Gly Pro Val Leu Gly Ala Val Gly Ser 
360 365 370 

TTC AGC TGG TCC GGA GGT GCC TTC TTA TAT CCC CCA AAT ACG AGA CCC 1206 
Phe Ser Trp Ser Gly Gly Ala Phe Leu Tyr Pro Pro Asn Thr Arg Pro 
375 380 385 

ACC TTT ATC AAC ATG TCT CAG GAG AAT GTG GAC ATG AGA GAC TCC TAC 1254 
Thr Phe He Asn Met Ser Gin Glu Asn Val Asp Met Arg Asp Ser Tyr 
390 395 400 405 

CTG GGT TAC TCC ACC GCA GTG GCC TTT TGG AAG GGG GTT CAC AGC CTG 1302 
Leu Gly Tyr Ser Thr Ala Val Ala Phe Trp Lys Gly Val His Ser Leu 
410 415 420 

ATC CTG GGG GCC CCG CGT CAC CAG CAC ACG GGG AAG GTT GTC ATC TTT 1350 
He Leu Gly Ala Pro Arg His Gin His Thr Gly Lys Val Val He Phe 
425 430 435 

ACC CAG GAA GCC AGG CAT TGG AGG CCC AAG TCT GAA GTC AGA GGG ACA 1398 
Thr Gin Glu Ala Arg His Trp Arg Pro Lys Ser Glu Val Arg Gly Thr 
440 445 450 

CAG ATC GGC TCC TAC TTC GGG GCC TCT CTC TGT TCT GTG GAC GTG GAT 1446 
Gin He Gly Ser Tyr Phe Gly Ala Ser Leu Cys Ser Val Asp Val Asp 
455 460 465 
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AGA GAT GGC AGC ACY GAG CTG GTC CTG ATC GGA GCC CCC CAT TAG TAT 
Arg Asp Gly Ser Xaa Asp Leu Val Leu lie Gly Ala Pro His Tyr Tyr 
470 475 480 485 



1494 



GAG GAG ACC CGA GGG GGG CAG GTC TCA GTG TTC CCC GTG CCC GGT GTG 
Glu Gin Thr Arg Gly Gly Gin Val Ser Val Phe Pro Val Pro Gly Val 
490 495 500 

AGG GGC AGG TGG CAG TGT GAG GCC ACC CTC CAC GGG GAG CAG GGC CAT 
Arg Gly Arg Trp Gin Cys Glu Ala Thr Leu His Gly Glu Gin Gly His 
505 510 515 



1542 



1590 



CCT TGG GGC GGC TTT GGG GTG GCT CTG ACA GTG CTG GGG GAC GTA AAC 
Pro Trp Gly Arg Phe Gly Val Ala Leu Thr Val Leu Gly Asp Val Asn 
520 525 530 



1638 



GGG GAC AAT CTG GCA GAC GTG GCT ATT GGT GCC CCT GGA GAG GAG GAG 
Gly Asp Asn Leu Ala Asp Val Ala lie Gly Ala Pro Gly Glu Glu Glu 
535 540 545 



1686 



AGC AGA GGT GCT GTC TAG ATA TTT CAT GGA GCC TCG AGA CTG GAG ATC 1734 
Ser Arg Gly Ala Val Tyr lie Phe His Gly Ala Ser Arg Leu Glu lie 
550 555 560 565 

ATG CCC TCA CCC AGC CAG CGG GTC ACT GGC TCC CAG CTC TCC CTG AGA 1782 
Met Pro Ser Pro Ser Gin Arg Val Thr Gly Ser Gin Leu Ser Leu Arg 
570 575 580 



CTG CAG TAT TTT GGG CAG TCA TTG AGT GGG GGT CAG GAC CTT ACA CAG 
Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly Gin Asp Leu Thr Gin 
585 590 595 

GAT GGC CTG GTG GAC CTG GCC GTG GGA GCC CAG GGG CAC GTA CTG CTG 
Asp Gly Leu Val Asp Leu Ala Val Gly Ala Gin Gly His Val Leu Leu 
600 605 610 



1830 



1878 



CTC AGG AGT CTG CCT CTG CTG AAA GTG GAG CTC TCC ATA AGA TTC GCC 
Leu Arg Ser Leu Pro Leu Leu Lys Val Glu Leu Ser lie Arg Phe Ala 
615 620 625 



1926 



CCC ATG GAG GTG GCA AAG GCT GTG TAG CAG TGC TGG GAA AGG ACT CCC 1974 

Pro Met Glu Val Ala Lys Ala Val Tyr Gin Cys Trp Glu Arg Thr Pro 

630 635 640 645 

ACT GTC CTC GAA GCT GGA GAG GCC ACT GTC TGT CTC ACT GTC CAC AAA 2022 

Thr Val Leu Glu Ala Gly Glu Ala Thr Val Cys Leu Thr Val His Lys 
650 655 660 



GGC TCA CCT GAC CTG TTA GGT AAT GTC CAA GGC TCT GTC AGG TAT GAT 
Gly Ser Pro Asp Leu Leu Gly Asn Val Gin Gly Ser Val Arg Tyr Asp 
665 670 675 

CTG GCG TTA GAT CCG GGC CGC CTG ATT TCT CGT GCC ATT TTT GAT GAG 
Leu Ala Leu Asp Pro Gly Arg Leu lie Ser Arg Ala lie Phe Asp Glu 
680 685 690 

ACT AAG AAC TGC ACT TTG ACG GGA AGG AAG ACT CTG GGG CTT GGT GAT 
Thr Lys Asn Cys Thr Leu Thr Gly Arg Lys Thr Leu Gly Leu Gly Asp 
695 700 705 



2070 



2118 



2166 
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CAC TGC GAA ACA GTG AAG CTG CTT TTG CCG GAG TGT 6TG GAA GAT GCA 2214 
His Cys Glu Thr Val Lys Leu Leu Leu Pro Asp Cys Val Glu Asp Ala 
710 715 /20 725 

GTG AGO CCT ATC ATC CTG CGC CTC AAC TTT TCC CTG GTG AGA GAC TOT 2262 
Val Ser Pro lie lie Leu Arg Leu Asn Phe Ser Leu Val Arg Asp Ser 
730 735 740 

GCT TCA CCC AGG AAC CTG CAT CCT GTG CTG GCT GTG GGC TCA CAA GAC 2310 
Ala Ser Pro Arg Asn Leu His Pro Val Leu Ala Val Gly Ser Gin Asp 
745 750 755 

CAC ATA ACT GCT TCT CTG CCG TTT GAG AAG AAC TGT AAG CAA GAA CTC 2358 
His lie Thr Ala Ser Leu Pro Phe Glu Lys Asn Cys Lys Gin Glu Leu 
760 765 770 

CTG TGT GAG GGG GAC CTG GGC ATC AGC TTT AAC TTC TCA GGC CTG CAG 2406 
Leu Cys Glu Gly Asp Leu Gly lie Ser Phe Asn Phe Ser Gly Leu Gin 
775 780 785 

GTC TTG GTG GTG GGA GGC TCC CCA GAG CTC ACT GTG ACA GTC ACT GTG 2454 
Val Leu Val Val Gly Gly Ser Pro Glu Leu Thr Val Thr Val Thr Val 
790 795 800 805 

TGG AAT GAG GGT GAG GAC AGC TAT GGA ACT TTA GTC AAG TTC TAC TAC 2502' 
Trp Asn Glu Gly Glu Asp Ser Tyr Gly Thr Leu Val Lys Phe Tyr Tyr 
810 815 820 

CCA GCA GGG CTA TCT TAC CGA CGG GTA ACA GGG ACT CAG CAA CCT CAT 2550 
Pro Ala Gly Leu Ser Tyr Arg Arg Val Thr Gly Thr Gin Gin Pro His 
825 830 835 

CAG TAC CCA CTA CGC TTG GCC TGT GAG GCT GAG CCC GCT GCC CAG GAG 2598 
Gin Tyr Pro Leu Arg Leu Ala Cys Glu Ala Glu Pro Ala Ala Gin Glu 
840 845 850 

GAC CTG AGG AGC AGC AGC TGT AGC ATT AAT CAC CCC ATC TTC CGA GAA 2 646 

Asp Leu Arg Ser Ser Ser Cys Ser lie Asn His Pro lie Phe Arg Glu 
855 860 865 

GGT GCA AAG ACC ACC TTC ATG ATC ACA TTC GAT GTC TCC TAC AAG GCC 2694 
Gly Ala Lys Thr Thr Phe Met lie Thr Phe Asp Val Ser Tyr Lys Ala 
870 875 880 885 

TTC CTA GGA GAC AGG TTG CTT CTG AGG GCC MiA GCC AGC AGT GAG AAT 2742 
Phe Leu Gly Asp Arg Leu Leu Leu Arg Ala Lys Ala Ser Ser Glu Asn 
890 895 900 

AAT AAG CCT GAT ACC AAC AAG ACT GCC TTC CAG CTG GAG CTC CCA GTG 2790 
Asn Lys Pro Asp Thr Asn Lys Thr Ala Phe Gin Leu Glu Leu Pro Val 
905 910 915 

AAG TAC ACC GTC TAT ACC CTG ATC AGT AGG CAA GAA GAT TCC ACC AAC 2838 
Lys Tyr Thr Val Tyr Thr Leu lie Ser Arg Gin Glu Asp Ser Thr Asn 
920 925 930 

CAT GTC AAC TTT TCA TCT TCC CAC GGG GGG AGA AGG CAA GAA GCC GCA 2886 
His Val Asn Phe Ser Ser Ser His Gly Gly Arg Arg Gin Glu Ala Ala 
935 940 945 
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CAT CGC TAT CGT GTG AAT AAC CTG AGT CCA CTG AAG CTG GCC GTC AGA 2934 
His Arg Tyr Arg Val Asn Asn Leu Ser Pro Leu Lys Leu Ala Val Arg 
950 955 960 965 

GTT AAC TTC TGG GTC CCT GTC CTT CTG AAC GGT GTG GCT GTG TGG GAC 2982 
Val Asn Phe Trp Val Pro Val Leu Leu Aen Gly Val Ala Val Trp Asp 
970 975 980 

GTG ACT CTG AGC AGC CCA GCA CAG GGT GTC TCC TGC GTG TCC CAG ATG 3030 
Val Thr Leu Ser Ser Pro Ala Gin Gly Val Ser Cys Val ser Gin Met 
985 990 995 



AAA CCT CCT CAG AAT CCC GAC TTT CTG ACC CAG ATT CAG AGA CGT TCT 3078 
Lys Pro Pro Gin Asn Pro Asp Phe Leu Thr Gin lie Gin Arg Arg Ser 
1000 1005 1010 

GTG CTG GAC TGC TCC ATT GCT GAC TGC CTG CAC TTC CGC TGT GAC ATC 3126 
Val Leu Asp Cys Ser He Ala Asp Cys Leu His Phe Arg Cys Asp He 
1015 1020 1025 

CCC TCC TTG GAC ATC CAG GAT GAA CTT GAC TTC ATT CTG AGG GGC AAC 3174 

Asp Phe He Leu Arg Gly Asn 
1030 1035 1040 1045 

CTC AGC TTC GGC TGG GTC AGT CAG ACA TTG CAG GAA AAG GTG TTG CTT 3222 
Leu Ser Phe Gly Trp Val Ser Gin Thr Leu Gin Glu Lys Val Leu Leu 
1050 1055 1060 

GTG AGT GAG GCT GAA ATC ACT TTC GAC ACA TCT GTG TAC TCC CAG CTG 3270 
Val Ser Glu Ala Glu He Thr Phe Asp Thr Ser Val Tyr Ser Gin Leu 
1065 1070 1075 

CCA GGA CAG GAG GCA TTT CTG AGA GCC CAG GTG GAG ACA ACG TTA GAA 3318 
Pro Gly Gin Glu Ala Phe Leu Arg Ala Gin Val Glu Thr Thr Leu Glu 
1080 1085 1090 

GAA TAC GTG GTC TAT GAG CCC ATC TTC CTC GTG GCG GGC AGC TCG GTG 3366 
Glu Tyr Val Val Tyr Glu Pro He Phe Leu Val Ala Gly Ser Ser Val 
1095 1100 1105 

GGA GGT CTG CTG TTA CTG GCT CTC ATC ACA GTG GTA CTG TAC AAG CTT 3414 
^i^y^^^^ Val Val Leu Tyr Lys Leu 

1110 1115 1120 1125 

GGC TTC TYC AAA CGT CAG TAC AAA GAA ATG CTG GAC GGC AAG GCT GCA 3462 
Gly Phe Xaa Lys Arg Gin Tyr Lys Glu Met Leu Asp Gly Lys Ala Ala 
1130 1135 1140 

GAT CCT GTC ACA GCC GGC CAG GCA GAT TTC GGC TGT GAG ACT CCT CCA 3510 
Asp Pro Val Thr Ala Gly Gin Ala Asp Phe Gly Cys Glu Thr Pro Pro 
1145 1150 1155 

TAT CTC GTG AGC TAGGAATCCA CTCTCCTGCC TATCTCTGCA ATGAAGATTG 3 
Tyr Leu Val Ser J^o^ 
1160 



GTCCTGCCTA TGAGTCTACT GGCATGGGAA CGAGT 



3597 
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<2) INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Ala Gly Gly Val Val lie Leu Leu Cys Gly Trp Val Leu Ala Ser 
15 10 15 

Cys His Gly Ser Asn Leu Asp Val Glu Glu Pro lie Val Phe Arg Glu 
20 25 30 

Asp Ala Ala Ser Phe Gly Gin Thr Val Val Gin Phe Gly Gly Ser Arg 
35 40 45 

Leu Val Val Gly Ala Pro Leu Glu Ala val Ala Val Asn Gin Thr Gly 
50 55 60 

Arg Leu Tyr Asp Cys Ala Pro Ala Thr Gly Met Cys Gin Pro lie Val 
65 70 75 80 

Leu Arg Ser Pro Leu Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu 
85 90 95 

Val Thr Ala Thr Asn Asn Ala Gin Leu Leu Ala Cys Gly Pro Thr Ala 
100 105 110 

Gin Arg Ala Cys Val Lys Asn Met Tyr Ala Lys Gly Ser Cys Leu Leu 
115 120 125 

Leu Gly Ser Ser Leu Gin Phe lie Gin Ala Val Pro Ala Ser Met Pro 
130 135 140 

Glu Cys Pro Arg Gin Glu Met Asp lie Ala Phe Leu lie Asp Gly Ser 
145 150 155 160 

Gly Ser lie Asn Gin Arg Asp Phe Ala Gin Met Lys Asp Phe Val Lys 
165 170 175 

Ala Leu Met Gly Glu Phe Ala Ser Thr Ser Thr Leu Phe Ser Leu Met 
180 185 190 

Gin Tyr Ser Asn lie Leu Lys Thr His Phe Thr Phe Thr Glu Phe Lys 
195 200 205 

Asn lie Leu Asp Pro Gin Ser Leu Val Asp Pro lie Val Gin Leu Gin 
210 215 220 

Gly Leu Thr Tyr Thr Ala Thr Gly lie Arg Thr Val Met Glu Glu Leu 
225 230 235 240 

Phe His Ser Lys Asn Gly Ser Arg Lys Ser Ala Lys Lys lie Leu Leu 
245 250 255 

Val lie Thr Asp Gly Gin Lys Tyr Arg Asp Pro Leu Glu Tyr Ser Asp 
260 265 270 
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Val He Pro Ala Ala Asp Lys Ala Gly He He Arg Tyr Ala He Gly 
275 280 285 

Val Gly Asp Ala Phe Gin Glu Pro Thr Ala Leu Lys Glu Leu Asn Thr 
290 295 300 

He Gly Ser Ala Pro Pro Gin Asp His Val Phe Lys Val Gly Asn Phe 

310 315 ' 320 

Ala Ala Leu Arg Ser He Gin Arg Gin Leu Gin Glu Lys He Phe Ala 
325 330 

He Glu Gly Thr Gin Ser Arg Ser Ser Ser Ser Phe Gin His Glu Met 
340 345 250 

ser Cln Glu Gly Phe Ser Ser Ala Leu Thr Ser Asp Gly Pro Val Leu 
355 360 365 

^no ^^"^ S^*^ T^P Ser Gly Gly Ala Phe Leu Tyr Pro 

370 375 3QQ 

Pro Asn Thr Arg Pro Thr Phe He Asn Met Ser Gin Glu Asn Val Asp 

390 395 

Met Arg Asp Ser Tyr Leu Gly Tyr Ser Thr Ala Val Ala Phe Trp Lvs 
405 410 4if ' 

Gly Val His Ser Leu He Leu Gly Ala Pro Arg His Gin His Thr Gly 
420 425 430 ^ 

Lys Val val He Phe Thr Gin Glu Ala Arg His Trp Arg Pro Lys Ser 
4Jb 440 445 

Glu Val Arg Gly Thr Gin He Gly Ser Tyr Phe Gly Ala Ser Leu Cys 

455 460 

ser Val Asp Val Asp Arg Asp Gly Ser Xaa Asp Leu Val Leu He Gly 

470 475 480 

Ala Pro His Tyr Tyr Glu Gin Thr Arg Gly Gly Gin Val Ser Val Phe 
485 490 

Pro Val Pro Gly Val Arg Gly Arg Trp Gin Cys Glu Ala Thr Leu His 
500 505 sio 

Gly Glu Gin Gly His Pro Trp Gly Arg Phe Gly Val Ala Leu Thr Val 

520 525 

Leu Gly Asp Val Asn Gly Asp Asn Leu Ala Asp Val Ala He Gly Ala 

535 540 

Pro Gly Glu Glu Glu Ser Arg Gly Ala Val Tyr He Phe His Gly Ala 

550 555 5gQ 

Ser Arg Leu Glu He Met Pro Ser Pro Ser Gin Arg Val Thr Gly Ser 
565 570 57I 

Gin Leu Ser Leu Arg Leu Gin Tyr Phe Gly Gin Ser Leu Ser Gly Gly 
5°0 585 590 
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Gin Asp Leu Thr Gin Asp Gly Leu Val Asp Leu Ala Val Gly Ala Gin 
595 600 605 

Gly His Val Leu Leu Leu Arg Ser Leu Pro Leu Leu Lys Val Glu Leu 
610 615 620 

Ser lie Arg Phe Ala Pro Met Glu Val Ala Lys Ala Val Tyr Gin Cys 
625 630 635 640 

Trp Glu Arg Thr Pro Thr Val Leu Glu Ala Gly Glu Ala Thr Val Cys 
645 650 655 

Leu Thr Val Hie Lys Gly Ser Pro Asp Leu Leu Gly Asn Val Gin Gly 
660 665 670 

Ser Val Arg Tyr Asp Leu Ala Leu Asp Pro Gly Arg Leu lie Ser Arg 
675 680 685 

Ala lie Phe Asp Glu Thr Lys Asn Cys Thr Leu Thr Gly Arg Lys Thr 
690 695 700 

Leu Gly Lou Gly Asp His Cys Glu Thr Val Lys Leu Leu Leu Pro Asp 
705 710 715 720 

Cys Val Glu Asp Ala Val Ser Pro lie lie Leu Arg Leu Asn Phe Ser 
725 730 735 

Leu Val Arg Asp Ser Ala Ser Pro Arg Asn Leu His Pro Val Leu Ala 
740 745 750 

Val Gly Ser Gin Asp His lie Thr Ala Ser Leu Pro Phe Glu Lys Asn 
755 760 765 

Cys Lys Gin Glu Leu Leu Cys Glu Gly Asp Leu Gly lie Ser Phe Asn 
770 775 780 

Phe Ser Gly Leu Gin Val Leu Val Val Gly Gly Ser Pro Glu Leu Thr 
785 790 795 800 

Val Thr Val Thr Val Trp Asn Glu Gly Glu Asp Ser Tyr Gly Thr Leu 
805 810 815 

Val Lys Phe Tyr Tyr Pro Ala Gly Leu Ser Tyr Arg Arg Val Thr Gly 
820 825 830 

Thr Gin Gin Pro His Gin Tyr Pro Leu Arg Leu Ala Cys Glu Ala Glu 
835 840 845 

Pro Ala Ala Gin Glu Asp Leu Arg Ser Ser Ser Cys Ser lie Asn His 
850 855 860 

Pro lie Phe Arg Glu Gly Ala Lys Thr Thr Phe Met lie Thr Phe Asp 
865 870 875 880 

Val Ser Tyr Lys Ala Phe Leu Gly Asp Arg Leu Leu Leu Arg Ala Lys 
885 890 895 

Ala Ser Ser Glu Asn Asn Lys Pro Asp Thr Asn Lys Thr Ala Phe Gin 
900 905 910 
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Leu Glu Leu Pro Val Lys Tyr Thr Val Tyr Thr Leu lie Ser Ara Gin 
915 920 925 

Glu Asp Ser Thr Aon His Val Asn Phe Ser Ser Ser His Glv Glv Aro 
930 935 940 

Arg Gin Glu Ala Ala His Arg Tyr Arg Val Asn Asn Leu Ser Pro Leu 
945 950 955 950 

Lys Leu Ala Val Arg Val Asn Phe Trp Val Pro Val Leu Leu Asn GlV 
965 970 975 ^ 

Val Ala V_al Trp Asp Val Thr Leu Ser Ser Pro Ala Gin Gly Val Ser 
980 985 990 

eye Val Ser Gin Met Lys Pro Pro Gin Asn Pro Asp Phe Leu Thr Gin 

1000 1005 

Ala Asp Cys Leu His 
^"10 1015 1020 

Phe Arg Cys Asp He Pro Ser Leu Asp He Gin Asp Glu Leu Asp Phe 

i"0 1035 1040 

He Leu Arg Gly Asn Leu Ser Phe Gly Trp Val Ser Gin Thr Leu Gin 
1045 1050 1055 

Glu Lye Val Leu Leu Val Ser Glu Ala Glu He Thr Phe Asp Thr Ser 
1060 1065 1070 

Val Tyr fer Gin Leu Pro Gly Gin Glu Ala Phe Leu Arg Ala Gin Val 
1075 1080 1085 

^^"^ TnSn^*''' ^^"^ ^^"^ ^^"^ TX^ I^he Leu Val 

-^^^^ 1095 1100 

^inc^^^ ^^"^ 1*®^ Ala Leu He Thr Val 

^^^^ mo 1115 1120 

Val Leu Tyr Lys Leu Gly Xaa Xaa Lys Arg Gin Tyr Lys Glu Met Leu 
1125 1130 1135 

Asp Gly Lys Ala Ala Asp Pro Val Thr Xaa Gly Gin Ala Asp Phe Gly 
1140 1145 iilo 

Cys Glu Thr Pro Pro Tyr Leu Val Ser 
1155 1160 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
CCTGTCATGG GTCTAACCTG 2 0 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

AGGTTAGACC CATGACAGG 19 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GGCCTTGCAG CTGGACAATG 20 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
CCAAAGCTGG CTGCATCCTC TC 22 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
CCGCCTGCCA CTGGCGTGTG C 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
CCCAGATGAA CCACTTCCTC AA 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
GCTGGGATCA TTCGCTATGC 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCTE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
CAATGGATGG ACCAGTTCTG G 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

CAGATCGGCT CCTACTTTGG 20 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CATGGAGCCT CGAGACAGG 19 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CCACTGTCCT CGAAGCTGGA G 21 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CTTCGTCCTG TGCTGGCTGT GGGCTC 26 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
CGCCTGGCAT GTGAGGCTGA G 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHT^RACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CCGTGATCAG TAGGCAGGAA G 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GTCACAGAGG GAACCTCC 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
GCTCCTGAGT GAGGCTGAAA TCA 
(2) INFORMATION FOR SEQ ID NO: 72: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GAGATGCTGG ATCTACCATC TGC 23 
(2) INFORMATION FOR SEQ ID NO: 73: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

CTGAGCTGGG AGATTTTTAT GG 22 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
< B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ing le 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
GTGGATCAGC ACTGAAATCT G 21 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CGTTTGAAGA AGCCAAGCTT G 21 
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(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 
CACAGCGGAG GTGCAGGCAG 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 
CTCACTGCTT GCGCTGGC 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
CGGTAAGATA GCTCTGCTGG 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
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GAGCCCACAG CCAGCACAGG 20 

(2) INFORMATION FOR SEQ ID NO: SO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
GATCCAACGC CAGATCATAC C 21 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
CACGGCCAGG TCCACCAGGC 20 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
CACGTCCCCT AGCACTGTCA G 21 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
TTGACGAAGT CCTTCATCTG GG 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
6AACTGCAAG CTGGAGCCCA G 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 
CTGGATGCTG CGAAGTGCTA C 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 
GCCTTGGAGC TGGACGATGG C 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



22 



21 



21 



21 
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<ii) MOLECULE TYPE: DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 



GTAAGATCTC CAGAGTGTCC AAGACAAGAG ATG 



33 



(2) INFOBMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

CTTCTCGAGT GTGAGAGCTG AACTGAAACC TTC 33 

<2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
CGCTGTGACG TCAGAGTTGA GTCCAAATAT GG 32 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



4 



(Xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



GGTGACACTA TAGAATAGGG C 



21 



(2) INFORMATION FOR SEQ ID NO: 91: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
AAGCAGGAGCTCCTGTGT 18 
(2) INFORMATION FOR SEQ ID NO: 92: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 61.-852 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

TGATCTCCCT CCAGGCCACT GTTCCCTCTC CACTTCCCCT CACCGCTGCA CTGCTCAGAG 60 

ATG GCC CTT GGG GCT GTG GTC CTC CTT GGG GTC CTG GCT TCT TAC CAC 108 
Met Ala Leu Gly Ala Val Val Leu Leu Gly Val Leu Ala Ser Tyr His 
15 10 15 

GGA TTC AAC TTG GAC GTG ATG AGO GGT GAT CTT CCA G6A AGA CGC AGC 156 
Gly Phe Asn Leu Asp Val Met Ser Gly Asp Leu Pro Gly Arg Arg Ser 
20 25 30 

GGG CTT CGG GCA GAG CGT GAT GCA GTT TGG GGA TCT CGA CTC GTG GTG 204 
Gly Leu Arg Ala Glu Arg Asp Ala Val Trp Gly Ser Arg Leu Val Val 
35 40 45 

GGA GCC CCC CTG GCG GTG GTG TCG GCC AAC CAC ACA GGA CGG CTG TAC 252 
Gly Ala Pro Leu Ala Val Val Ser Ala Asn His Thr Gly Arg Leu Tyr 
50 55 60 

GAG TGT GCG CCT GCC TCC GGC ACC TGC ACG CCC ATT TTC CCA TTC ATG 300 
Glu Cys Ala Pro Ala Ser Gly Thr Cys Thr Pro He Phe Pro Phe Met 
65 70 75 80 

CCC CCC GAA GCC GTG AAC ATG TCC CTG GGC CTG TCC CTG GCA GCC TCC 348 
Pro Pro Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu .Ala Ala Ser 
85 90 95 

CCC AAC CAT TCC CAG CTG CTG GCT TGT GGC CCG ACC GTG CAT AGA GCC 396 
Pro Asn His Ser Gin Leu Leu Ala Cys Gly Pro Thr Val His Arg Ala 
100 105 110 
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TGC GGG GAG GAC GTG TAG GCC CAG GGT TTC TGT GTG CTG CTG GAT GCC 444 
Cye Gly Glu Asp Val Tyr Ala Gin Gly Phe Cys Val Leu Leu Asp Ala 
XX3 

CAC GCA CAG CCC ATC GGG ACT GTG CCA GCT GCC CTG CCC GAG TGC CCA 492 
His Ala Gin Pro lie Gly Thr Val Pro Ala Ala Leu Pro Glu Cys Pro 
130 135 140 

GAT CAA GAG ATG GAC ATT GTC TTC CTG ATT GAC GGC TCT GGC AGC ATT 540 
Asp Gin Glu Met Asp lie Val Phe Leu lie Asp Gly Ser Gly Ser lie 
145 150 155 160 

AGC TCA AAT GAC TTC CGC AAG ATG AAG GAC TTT GTC AGA GCT GTG ATG 588 
Ser Ser Asn Asp Phe Arg Lys Met Lys Asp Phe Val Arg Ala Val Met 
165 170 175 

GAC CAG TTC AAG CAC ACC AAC ACC CAG TTC TCG CTG ATG CAG TAG TGC 636 
Asp Gin Phe LyB Asp Thr Asn Thr Gin Phe Ser Leu Met Gin Tyr Ser 
ISO 185 190 

AAT GTG CTG CTG ACA CAT TTC ACC TTC AGC AGC TTC CGG AAC AGC TCC 684 
Asn Val Leu Val Thr Hio Phe Thr Phe Ser Ser Phe Arg Asn Ser Ser 
195 200 205 

AAT CCT CAG CCC CTA CTG GAG CCC ATT GTG CAG CTG ACA GGC CTC ACG 732 
Asn Pro Gin Gly Leu Val Glu Pro lie Val Gin Leu Thr Gly Leu Thr 
210 215 220 

TTC ACG GCC ACA GGC ATC CTG AAA GTG GTG ACA GAG CTG TTT CAA ACC 780 
Phe Thr Ala Thr Gly lie Leu Lys Val Val Thr Glu Leu Phe Gin Thr 
225 230 235 240 

AAG AAC GGG GCC CGC GAA AGT GCC AAG AAG ATC CTC ATC GTC ATC ACA 828 
Lys Asn Gly Ala Arg Glu Ser Ala Lys Lys lie Leu lie Val lie Thr 
245 250 255 

GAT GGG CAG AAG TAC AAA GCG GCA 852 
Asp Gly Gin Lys Tyr Lys Ala Ala 
260 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 amino acids 

( B ) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Met Ala Leu Gly Ala Val Val Leu Leu Gly Val Leu Ala Ser Tyr His 
15 10 15 

Gly Phe Asn Leu Asp Val Met Ser Gly Asp Leu Pro Gly Arg Arg Ser 
20 25 30 

Gly Leu Arg Ala Glu Arg Asp Ala Val Trp Gly Ser Arg Leu Val Val 
35 40 45 
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Gly Ala Pro Leu Ala Val Val Ser Ala Asn Hie Thr Gly Arg Leu Tyr 
50 55 60 

Glu Cys Ala Pro Ala Ser Gly Thr Cys Thr Pro He Phe Pro Phe Met 
65 70 75 80 

Pro Pro Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu Ala Ala Ser 
85 90 95 

Pro Asn His Ser Gin Leu Leu Ala Cys Gly Pro Thr Val Hie Arg Ala 
100 105 110 

Cys Gly Glu Asp Val Tyr Ala Gin Gly Phe Cys Val Leu Leu Asp Ala 
115 120 125 

His Ala Gin Pro lie Gly Thr Val Pro Ala Ala Leu Pro Glu Cys Pro 
130 135 140 

Asp Gin Glu Met Asp lie Val Phe Leu lie Asp Gly Ser Gly Ser lie 

150 155 160 

Ser Ser Asn Asp Phe Arg Lys Met Lys Asp Phe Val Arg Ala Val Met 
165 170 175 

Asp Gin Phe Lys Asp Thr Asn Thr Gin Phe Ser Leu Met Gin Tyr Ser 
180 185 190 

Asn Val Leu Val Thr His Phe Thr Phe Ser Ser Phe Arg Asn Ser Ser 
195 200 205 

Asn Pro Cln Gly Leu Val Glu Pro lie Val Gin Leu Thr Gly Leu Thr 
210 215 220 

Phe Thr Ala Thr Gly He Leu Lys Val Val Thr Glu Leu Phe Gin Thr 
225 230 235 240 

Lys Asn Gly Ala Arg Glu Ser Ala Lys Lys He Leu He Val He Thr 
245 250 255 

Asp Gly Gin Lys Tyr Lys Ala Ala 
260 
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WHAT IS CLAIMED IS: 

1. A purified and isolated polynucleotide consisting essentially 
of human protein coding sequence set out in SEQ ID NO: 1. 

2. The polynucleotide of claim 1 which is a DNA molecule. 

3. The DNA molecule of claim 2 which is a cDNA molecule. 

4. The DNA molecule of claim 2 which is a genomic DNA 

molecule. 

5. The DNA molecule of claim 2 which is a wholly or partially 
chemically synthesized DNA molecule. 

6. A full length purified and isolated ajj-encoding polynucleotide 
selected from the group consisting of: 

a) the human DNA sequence set out in SEQ ID NO: 1, and 

b) a DNA molecule which hybridizes under stringent conditions 
to the noncoding strand of the protein coding portion of the DNA of a). 

7. A DNA molecule encoding the human amino acid sequence 
set out in SEQ ID NO: 2. 

8. A DNA expression construct comprising a DNA molecule 
according to claim 2. 

9. A host cell transformed with a DNA molecule according to 

claim 2. 
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10. A method for producing an polypeptide comprising growing 
a host cell according to claim 9 in a suitable medium and isolating polypeptide 
from said host cell or the medium of its growth. 

11. Purified and isolated polypeptide consisting essentially of 
the human amino acid sequence set out in SEQ ID NO: 2. 

12. A polypeptide capable of specifically binding to a^. 

13. A polypeptide according to claim 12 which is an antibody. 

14. An antibody according to claim 13 which is a monoclonal 



antibody. 



of claim 14. 



15. An anti-idiotype antibody specific for the monoclonal antibody 



16. A hybridoma cell line producing the monoclonal antibody 
according to claim 14. 



17. A purified and isolated extracellular domain polypeptide 
fragment comprising amino acids 17 to 1108 of the human amino acid 
sequence set out in SEQ ID NO: 2. 

18. A purified and isolated I domain polypeptide fragment 
comprising amino acids 145 to 355 of the human amino acid sequence set out 
in SEQ ID NO: 2. 
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19. A fusion protein comprising extracellular domain 
polypeptide amino acids 17 to 1 108 of SEQ ID NO: 2 and human immunoglobulin 
constant domain sequences. 

20. A purified and isolated murine polynucleotide consisting 
essentially of the a subunit protein coding sequence set out in SEQ ID NO: 45. 

21. A method for isolating a polynucleotide encoding a protein that 
binds to comprising the steps of: 

a) transforming or transfecting appropriate host cells with a DNA 
construct comprising a reporter gene under the control of a promoter regulated by 
a transcription factor having a DNA-binding domain and an activating domain; 

b) expressing in said host cells a first hybrid DNA sequence 
encoding a first fusion of part or all of and either the DNA binding domain or 
the activating domain of said transcription factor; 

c) expressing in said host cells a library of second hybrid DNA 
sequences encoding second fusions of part or all of putative binding proteins 
and the DNA binding domain or activating domain of said transcription factor 
which is not incorporated in said first fusion; 

d) detecting binding of an binding protein to in a particular 
host cell by detecting the production of reporter gene product in said host cell; 
and 

e) isolating second hybrid DNA sequences encoding binding 
protein from said particular host cell. 

22. A method for identifying a compound capable of reacting 
specifically with and of modulating the interaction of binding partners and 
ICAM-R comprising the steps of: 
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a) immobilizing or a fragment thereof, or ICAM-R or a 
fragment thereof, on a solid support coated or impregnated with a fluorescent 
agent; 

b) labelling the non-immobilized binding partner with a compound 
capable of exciting said fluorescent agent; 

c) contacting said immobilized binding partner with said labelled 
binding partner in the presence and absence of a putative modulator compound 
capable of specifically reacting with a^; 

d) detecting light emission by said fluorescent agent; and 

e) identifying modulating compounds as those compounds that affect 
the emission of light by said fluorescent agent in comparison to the emission of 
light by said fluorescent agent in the absence of said modulating compound. 

23. A purified and isolated or^ extracellular domain polypeptide 
fragment comprising about amino acid 127 to about amino acid 353 of the human 

amino acid sequence set out in SEQ ID NO: 2. 

24. A fusion protein comprising the polypeptide fragment of claim 
23 and human immunoglobulin constant region sequences. 

25. A purified and isolated extracellular domain polypeptide 
fragment comprising about amino acid 17 to about amino acid 603 of the human 

amino acid sequence set out in SEQ ID NO: 2. 

26. A fusion protein comprising the polypeptide fragment of claim 
25 and human immunoglobulin constant region sequences. 



wo 95/17412 



1/4 



PCT/US94/14832 



^ ^ LO ' 



vo r-^ 00 



cr» o> 9> 



CM CM CM 



O* C3^ 
CM CM CM 



=eoe£ ^lA-Lk oeetffi: ^t/tv^ 

(/)«/>(/> ^^<c oc^u zujz oc9Ci0 euujoe 

UJV9UJ (/)</>(/) ZZ>- Ci9 (/> Z 

OCX H-(/)t/) c^tt0 >'^^ ^H-(/) ooeo 

i^^>- -j^-j H- — J fie: a o a u. u- u. 

oao U9csci9 zzx 

>>> ^^.iJ OUI> IkLkU. SO-J 

>>> (/)(/)(/) Q.^a. <(/)(/) C9C9C9 

(/)(/)(/^ ZZZ >U.-J OUItt >>> 

OOO ZZZ H-^Oe -J — I O h-OCZ C9C9C9 

U.U.bu I-IO.I- 0&(/) >?^> 0< 

OC0C9 UJUJUJ H-lO-J H-</)0. H-^Z >->->• 

uoe< a.>a. ujo O cs^oe — loeo octtotf 

Og^eL i UJ^O l-4l-^>-H 1-41-11-^ 

QZO >-4>> I I Lb^U. C9C9< l-4>l-l 

UJIU> zoo OeZf— OO0 I— H-H- V9V9«0 

OOQC -J^^ (/)</)0. C0UJ(/) <<< •^UJ< 

U.U.U. 0.0^0 C0(9o I— :)^oe< 

KHh;< MI-HH^ «^U.^ >>> LkS>- UiOO 

oZS!Ij oSjSj ijkZfiif StoS ^fllstz SSS 

LUZUJ OCJCJ UCJO >>> O<3C0 0.^0. 

>h-^ oo<3 tfcsu oujz .j^^ >>> 

OOO H-f-i— ^^1-! 

-Ji^^ (/)>-J zzS >»»s (/)ujZ 

ZZZ >->->- OOCh- i-ll-H> >->->- 

tkU.U. <OC9 (/>l— Z ZOS< 0.&(/) UlOO 

C^CJU ZZZ U.U.U. o^«c ^-J^ 

ss-j OOO uiuioe ooz >>.j o.o.(/) 

>-0</) >->->- 0(/)0 ZSOtf J-J OOO 

(/)^H- .iJ— i-J OOO 00.(/) (^(/)(/) Scstf 

2h- < ac</io >H-uj o HI CO aae— J >-u.uj 

^iiS csvso eeoz i-ii-ihH oo.^ 

x< ^-oe»-i ssz t/)t/)(/) (/)zz ouj^ 

h- OOO OOO 0.0^(0 OOO 

^-JU. ZZZ h-H- (/>(/>(/) inZ^ OOO 

— 1^ i-i ^^O. OOO h-ZQg H-H- H- 

> OOO OOO oeott i-ii-Hi-4 

I I «C > l-H M «£«£<; _1 — J -J OUJUJ l-4>»-4 

I I ^ UJUJ^ U.U.kk h- >«SUJ 

H-oe« .joo ^-J— J >o u.u.Lk MH-ii-H 

o^H- o-^o. oeoo »-4i-i>-4 

u. < h- OOO 00.0. Z(/)0 zaeae <<< 

h- zae >>> z</i(/) uiouj i-4H-«^ u)zo 

GQ U CO U 00 U OQ U CBU OQ U 

OOO OOO OOO OOO OOO OOO 

aoo OOO »oo eoo aoo «oo 

BNSDOCID: <WO_9617412A1J_> 



o 



wo 95/17412 PCT/US94/14832 

2/4 

S£^S iSE^S u>i^r^ M9r^f^ 

^^S^ SSiS^ 222 2S»^ o^o^c> 

mcnen rororo ^^^r ^^s-^ inunun iSiuiui 



>(/)> 

UJ > LU 



CO OO i/) 

c/></> go 

SSS 111 Si3 §§§ ill 

SuiS 555^52 



LU UJ UJ 



Si 



U. >- 
C9 



C9(/><0 



</)(/) i/) 
u. u. u. 



sis 

C9 C9C0 

»— 



C9C0C9 



O O O U. Lu U. 

UJ o CP >.>->: 



wo OO CO 



C0 



333 ^s^^ 

-J-J^ CO 
-J-J-J CO CD C9 



CO (/) to 
C9C9C9 



COC/)(/) 

c^ c^c^ 



CO 

s 



3S3 



(/)(/)(/) 



C9C9C9 
OOO 
UJUJUJ 



Hi 

003 



«/)</>(/) 

Nl l-H 

CO (/) Ol. 
UJC9(0 
(/)(/> -J 



ooS 

UJ O UJ 



SS r2l2 •^'H 

oSS OOO OOO OOO oaS -'^'^ 
cs«^o eow oSS «So 



■HtH 



CNSOOCID: <WO gs17412A1 I > 



J 



wo 95/17412 PCT/US94/14832 



- 163 - 

27. A purified and isolated extracellular domain polypeptide 
fragment comprising about amino acid 17 to about about amino acid 1029 of the 
human amino acid sequence set out in SEQ ID NO: 2, 

28- A fusion protein comprising the polypeptide fragment of claim 
27 and immunoglobulin constant region sequences. 

29. A purified and isolated murine polynucleotide comprising the 
a subunit protein coding sequence as set out in SEQ ID NO: 52. 

30. A purified and isolated polypeptide consisting essentially 
of the murine amino acid sequence set out in SEQ ID NO: 53. 

31. A purified and isolated rat polynucleotide comprising the ci 
subunit protein coding sequence as set out in SEQ ID NO: 54. 

32. A purified and isolated polypeptide consisting essentially 
of the rat a^j amino acid sequence set out in SEQ ID NO: 55. 

33. A purified and isolated polypeptide fragment comprising 
extracellular domain sequences of the polypeptide of claim 32. 

34 . A polypeptide capable of specifically binding to the 
polypeptide of claim 32. 

35. A polypeptide according to claim 34 which is an antibody. 

36. An antibody according to claim 35 which is a monoclonal 

antibody. 
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37. A rodent that does not express a functional protein, 

38. A rodent that expresses a variant aj protein. 
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